Gene EcolC_4136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4136 
Symbol 
ID6066230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4562212 
End bp4563645 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content50% 
IMG OID641603557 
Productmajor facilitator transporter 
Protein accessionYP_001727060 
Protein GI170022106 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAA TCACCAGTCC TGCAACTTAT TCAATTAGTC GGCCACAGGA CGTAATTGAT 
ATTGTTAATA AGAACTCTGC AATCAACACC AGTATCGGCG TTATTTTTAT TGCTTTGGGC
GGCATATTGA TTGATGCCTA CCAGGCGGCG ATGGTGGGGT TTGGTAATAA ATACATTGCC
GCTCAATTTG GCATTTCTCC AGGCCTTGCC GCAACCGTTA ATGCGTCAGT ATTAATCGCC
GCGTTAATTG GCGGTTTATT AGCGAACCGA GTAATAAACC GCTTTGGGCA AAAGCGAGCA
TTTATTATTG GCATGGGGCT GTGCACCATC GGGGCTGCTG CGGTAGCTAT TGCGCCCAGT
ATCTGGTGGG TGCTGGTGTG CCGCGTCATC ATGGGCTTTG GTTTAGGCAT CGACTTCCCT
TTGGCAACCA ATGCCGTGGC AGAGCTTCGT GGTTCAACGT CGAAGAAAAC CGGAACGTCG
GTCAACCTCT GGCAAATGGC CTGGTATGTT TCGACAACTG TTGTTTATTT GGTGCTCTTG
CCGCTGCTTC TGTCGGGTAT CGCTGAAGAA CAATTGTGGC GTTACGGAAT ATTCATCGGA
GCTATTTTTG CAGTCATCTT CATGATTTTG CGTTACTTCT TTATTGGTGA ATCCGCAATG
TGGGCCGCAC GCGTCGGGCG TTACCAGGAA GCGTGCGACA TTCTGGGAAA ACGTTATGGT
GTTCAGGCTC GCGTTGCGGC ATCGAGTACA ACAGAAGCGA AATTCTCGGA AAAAGCAGAG
AATAAATACA GTGGTGGATA TGGCATCTTA TTTAATGATC GTTACCGCAA ACGCACCATT
CTTGGCTGTG TCGTGGCAAC CATGCAGGCG TGGCAATATA ACGCCGTAGG TGTTTATCTT
CCTCTTACGT TGGCGGGAAT AATAAGTGGC GGGCTTACTG GTGCGTTGAC GGGTTCTGCC
GTCGTGAATG CCCTTTGTGG GGTGACAGGC GGGATGATCG GCTCGTTTAT TCTCCAACGA
CTGGGTACTC GCCGACAGTC GATGTATGGA TTTGCTGTTG TGACCTTAGC ATTGCTGTCG
TTAGGCGCAC TGGCAACGAC TAATCCATGG CTGTCTTTAG GGTTATTGGG ATCAATTATT
TTCTTCCATT CAGCGGGTCC TGGTGGGCTG GGCATGACCA TTGCCACACT CTCTTATCCT
CCCGCTATTC GCCCTACTGG GGTCGGATTT GCCCGCGCTA TTATGCGCAC AGGGGCAATT
GCAGGACTCA TTTTCTGGCC GATGCTGTGG GGGGCGTTGA AAACTGAAGC GTTTTACTGG
TTGGCAATCG TGCCATTCTT GGGGTTCCTG ACCTGCGTAT TGATTAATTG GGAACCACTG
GGTGCAAATG TTGATGCTGA GGATGCAGAG GTTCTGGCTG AATTGAAGAA ATAA
 
Protein sequence
MSQITSPATY SISRPQDVID IVNKNSAINT SIGVIFIALG GILIDAYQAA MVGFGNKYIA 
AQFGISPGLA ATVNASVLIA ALIGGLLANR VINRFGQKRA FIIGMGLCTI GAAAVAIAPS
IWWVLVCRVI MGFGLGIDFP LATNAVAELR GSTSKKTGTS VNLWQMAWYV STTVVYLVLL
PLLLSGIAEE QLWRYGIFIG AIFAVIFMIL RYFFIGESAM WAARVGRYQE ACDILGKRYG
VQARVAASST TEAKFSEKAE NKYSGGYGIL FNDRYRKRTI LGCVVATMQA WQYNAVGVYL
PLTLAGIISG GLTGALTGSA VVNALCGVTG GMIGSFILQR LGTRRQSMYG FAVVTLALLS
LGALATTNPW LSLGLLGSII FFHSAGPGGL GMTIATLSYP PAIRPTGVGF ARAIMRTGAI
AGLIFWPMLW GALKTEAFYW LAIVPFLGFL TCVLINWEPL GANVDAEDAE VLAELKK