Gene EcolC_1090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1090 
Symbol 
ID6066567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1187176 
End bp1188474 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content49% 
IMG OID641600506 
Productalpha-ketoglutarate transporter 
Protein accessionYP_001724084 
Protein GI170019130 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.153199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00390378 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGAAA GTACTGTAAC GGCAGACAGC AAACTGACAA GTAGTGATAC TCGTCGCCGC 
ATTTGGGCGA TTGTGGGGGC CTCTTCAGGT AATCTGGTCG AGTGGTTCGA TTTCTATGTC
TACTCGTTCT GTTCACTCTA CTTCGCCCAC ATTTTCTTCC CTTCCGGGAA CACGACGACT
CAACTACTAC AAACAGCAGG TGTTTTTGCT GCGGGATTCC TGATGCGCCC AATAGGCGGT
TGGCTATTTG GCCGCATAGC CGATAAACAT GGTCGCAAAA AATCGATGCT GTTATCGGTG
TGTATGATGT GTTTCGGATC GCTGGTTATC GCCTGCCTCC CAGGTTATGA AACTATAGGT
ACGTGGGCTC CGGCATTATT GCTTCTCGCT CGTTTATTTC AGGGATTATC CGTTGGCGGA
GAATATGGCA CCAGCGCCAC CTATATGAGT GAAGTTGCCG TTGAAGGGCG CAAAGGTTTT
TACGCATCAT TTCAGTATGT GACGTTGATC GGCGGACAAC TGCTAGCCCT ACTGGTTGTC
GTGGTTTTAC AACACACCAT GGAAGACGCT GCACTCAGAG AGTGGGGATG GCGTATTCCT
TTCGCGTTAG GAGCTGTGTT AGCTGTTGTG GCGTTGTGGT TACGTCGTCA GTTAGATGAA
ACTTCGCAAC AAGAAACGCG CGCTTTAAAA GAAGCTGGAT CTCTGAAAGG ATTATGGCGC
AATCGCCGTG CATTCATCAT GGTTCTCGGT TTTACCGCTG CGGGCTCCCT TTGTTTCTAT
ACCTTCACTA CTTATATGCA GAAGTATCTG GTAAATACTG CGGGAATGCA TGCCAACGTG
GCGAGTGGCA TTATGACTGC CGCATTGTTT GTATTCATGC TTATTCAACC ACTCATTGGC
GCGCTGTCGG ATAAGATTGG TCGCCGTACC TCAATGTTAT GTTTCGGTTC GCTGGCAGCC
ATTTTTACCG TTCCTATTCT CTCAGCATTG CAAAACGTTT CCTCGCCTTA TGCCGCTTTT
GGTCTGGTGA TGTGTGCCCT GCTGATAGTG AGTTTTTATA CATCAATCAG TGGAATACTG
AAGGCTGAGA TGTTCCCGGC ACAGGTTCGC GCATTAGGCG TTGGTCTGTC ATATGCGGTC
GCTAATGCTA TATTTGGTGG TTCGGCGGAG TACGTAGCGT TGTCGCTGAA ATCAATAGGA
ATGGAAACAG CCTTCTTCTG GTATGTGACC TTGATGGCCG TGGTGGCGTT TCTGGTTTCT
TTGATGCTAC ATCGCAAAGG GAAGGGGATG CGTCTTTAG
 
Protein sequence
MAESTVTADS KLTSSDTRRR IWAIVGASSG NLVEWFDFYV YSFCSLYFAH IFFPSGNTTT 
QLLQTAGVFA AGFLMRPIGG WLFGRIADKH GRKKSMLLSV CMMCFGSLVI ACLPGYETIG
TWAPALLLLA RLFQGLSVGG EYGTSATYMS EVAVEGRKGF YASFQYVTLI GGQLLALLVV
VVLQHTMEDA ALREWGWRIP FALGAVLAVV ALWLRRQLDE TSQQETRALK EAGSLKGLWR
NRRAFIMVLG FTAAGSLCFY TFTTYMQKYL VNTAGMHANV ASGIMTAALF VFMLIQPLIG
ALSDKIGRRT SMLCFGSLAA IFTVPILSAL QNVSSPYAAF GLVMCALLIV SFYTSISGIL
KAEMFPAQVR ALGVGLSYAV ANAIFGGSAE YVALSLKSIG METAFFWYVT LMAVVAFLVS
LMLHRKGKGM RL