Gene Lcho_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0233 
Symbol 
ID6160552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp249060 
End bp250733 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content65% 
IMG OID641662977 
Productgeneral substrate transporter 
Protein accessionYP_001789273 
Protein GI171056924 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00123481 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCAGCA TCCCGATCAA TGCGACCAAG AAGGCCGTTC CGCCGGCCAT GACACCCGAA 
GAGCGGAAGG TGATCTTCGC CTCCTCGCTG GGCACGGTGT TCGAGTGGTA CGACTTCTAC
CTCTACGGCT CGCTCGCAGC CATCATCGCC AAGCAGTTCT TCGCGGGACT GGATGCCGGC
TCGGCCTTCA TCTTCGCGCT GCTCGCCTTT GCCGCCGGCT TCATCGTGCG CCCGTTCGGC
GCGCTGGTGT TCGGTCGCCT GGGCGACATG ATCGGCCGCA AGTACACCTT CCTGGTGACG
ATCCTGATCA TGGGCCTGGC GACGTTCATC GTCGGCATCC TGCCCAACTA CGAGTCGATC
GGCGTGGCCG CGCCGGTGAT CCTGATCGCG CTGCGCATGC TGCAGGGCCT GGCGCTCGGC
GGTGAATACG GCGGTGCCGC CACCTACGTG GCCGAACACG CGCCGCACGG CAAGCGCGGC
GCCTACACGG CGTGGATCCA GACCACCGCG ACGCTGGGCC TGTTCCTGTC GCTGATGGTC
ATTCTCGGCA CCCGCACCGC GATCGGCGAA GCCGCCTTCG CCGACTGGGG CTGGCGCATC
CCGTTCATCG TCTCGATCGC CCTGCTGGCC ATCAGCGTGT GGATCCGCCT GTCGATGAAC
GAATCGCCCG CCTTCAAGAA GATGAAGGAA GAGGGCAAGA CCTCGAAGGC GCCGCTGTCG
GAGTCGTTCG GCCAGTGGAA GAACCTGAAG ATCGTGATCC TGGCGCTGAT CGGCCTGACC
GCCGGCCAGG CCGTTGTCTG GTACACGGGC CAGTTCTACG CGCTGTTCTT CCTGACGCAG
GCCCTCAAGG TCGACGGCGC CACCGCCAAC GTGCTGGTCG CTGCCTCGCT GGTGATCGGC
ACGCCGTTCT TCATCGTGTT CGGCGCGCTG TCCGACAAGA TCGGCCGCAA GCCCATCATC
ATGGCCGGCT GCCTGATCGC CGCGCTGACC TTCTTCCCGC TGTTCAAGGC GCTGACCCAG
GCCGCCAACC CGGACCTCGC GACGGCCCAG GCCAACGCCA AGGTCACCAT CACGGCCGAC
CCCAAGGAGT GCTCGTTCCA GTTCAACCCG ACCGGCACGA AGAAGTTCAC CAGCTCCTGC
GACATCGCCA AGCAGGTGCT GGCCGGCGCT TCGGTCAGCT ACGACAACAT CGAGGCCACC
GGCCCCGCCA AGATCACGGT GGGCACGACG GTCATCGAGG GCTACACCTC GGCCGGCCTG
GCGGCTGACG AAGCCAAGAA GAAGGACGCC GAGTTCAAGA AGGCCGTCGC CGATGCGCTC
AAGGCCGCCG GCTACCCGGC CAAGGCCGAT CCGGCCAAGG TCGACAAGCT GAAGATCATC
GTCATCCTGA CGATCCTCGT GATCTACGTG ACGATGGTCT ACGGCCCGAT CGCGGCGATG
CTGGTGGAGA TGTTCCCGAC CCGCATCCGC TACACCTCGA TGAGCCTGCC GTACCACATC
GGCAACGGCT GGTTCGGCGG CCTGCTGCCC ACCACCGCCT TCGCCATCGT GGCGCAGACC
GGCAACATGT ACAACGGCCT CTGGTATCCG ATCATCATCG CCGGCGCGAC CTTCGTCATC
GGCATGCTGT TCATCAAGGA AACCAAGGAC GTCGACATCT ACGCCGACGA CTGA
 
Protein sequence
MSSIPINATK KAVPPAMTPE ERKVIFASSL GTVFEWYDFY LYGSLAAIIA KQFFAGLDAG 
SAFIFALLAF AAGFIVRPFG ALVFGRLGDM IGRKYTFLVT ILIMGLATFI VGILPNYESI
GVAAPVILIA LRMLQGLALG GEYGGAATYV AEHAPHGKRG AYTAWIQTTA TLGLFLSLMV
ILGTRTAIGE AAFADWGWRI PFIVSIALLA ISVWIRLSMN ESPAFKKMKE EGKTSKAPLS
ESFGQWKNLK IVILALIGLT AGQAVVWYTG QFYALFFLTQ ALKVDGATAN VLVAASLVIG
TPFFIVFGAL SDKIGRKPII MAGCLIAALT FFPLFKALTQ AANPDLATAQ ANAKVTITAD
PKECSFQFNP TGTKKFTSSC DIAKQVLAGA SVSYDNIEAT GPAKITVGTT VIEGYTSAGL
AADEAKKKDA EFKKAVADAL KAAGYPAKAD PAKVDKLKII VILTILVIYV TMVYGPIAAM
LVEMFPTRIR YTSMSLPYHI GNGWFGGLLP TTAFAIVAQT GNMYNGLWYP IIIAGATFVI
GMLFIKETKD VDIYADD