Gene EcolC_3648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3648 
Symbol 
ID6065647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3994342 
End bp3995772 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content54% 
IMG OID641603063 
Productamino acid carrier protein 
Protein accessionYP_001726586 
Protein GI170021632 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1115] Na+/alanine symporter 
TIGRFAM ID[TIGR00835] amino acid carrier protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGATT TTTTCTCCTT TATTAACAGC GTCCTTTGGG GATCGGTAAT GATTTACCTG 
CTCTTCGGCG CAGGTTGTTG GTTCACTTTT CGCACCGGAT TTGTGCAGTT TCGCTACATC
CGCCAGTTTG GCAAAAGTCT TAAAAATAGC ATTCATCCAC AGCCAGGTGG TTTAACCTCA
TTTCAGTCAT TGTGTACCAG TCTTGCGGCG CGCGTGGGTA GCGGCAACCT GGCCGGCGTT
GCGCTGGCAA TTACCGCCGG TGGACCTGGA GCCGTCTTCT GGATGTGGGT TGCCGCGTTT
ATCGGCATGG CGACCTCGTT TGCCGAATGT TCCCTCGCAC AGCTTTATAA AGAACGTGAT
GCCAATGGGC AGTTTCGTGG CGGACCGGCA TGGTATATGG CGCGCGGGCT GGGGATGCGC
TGGATGGGCG TTCTGTTCGC CGTCTTTTTG CTCATCGCCT ACGGCATAAT TTTCAGCGGA
ATTCAGGCGA ACGCCGTTGC GCGAGCCCTG AGTTTTTCTT TTGATTTTCC CCCGCTGGTG
ACAGGTATTA TTCTCGCTGT CTTTGCTCTG CTGGCGATCA CTCGCGGTCT TCATGGCGTC
GCCCGGCTCA TGCAGGGGTT TGTCCCGTTG ATGGCGATAA TCTGGGTACT GACCAGCCTG
GTGATTTGCG TAATGAATAT CGGGCAACTT CCCCACGTCA TTTGGTCTAT TTTTGAGAGT
GCTTTTGGCT GGCAGGAAGC GGCAGGCGGC GCGGCGGGAT ATACCTTAAG CCAGGCGATT
ACTAACGGTT TTCAGCGCAG TATGTTTTCC AATGAGGCGG GAATGGGGTC GACGCCAAAC
GCGGCAGCGG CAGCGGCGTC CTGGCCTCCG CATCCGGCAG CGCAAGGGAT TGTCCAGATG
ATTGGCATTT TTATCGACAC CCTGGTCATC TGTACGGCAA GCGCCATGCT GATATTACTG
GCGGGTAACG GCACAACCTA CATGCCGCTG GAAGGTATTC AGCTTATCCA GAAGGCGATG
CGGGTGCTAA TGGGTTCCTG GGGTGCTGAG TTTGTTACCC TCGTGGTTAT TCTGTTTGCC
TTCAGCTCCA TCGTTGCCAA CTACATTTAC GCCGAAAACA ATCTCTTCTT TTTACGCCTG
AACAACCCTA AAGCGATCTG GTGTTTGCGG ATCTGCACCT TCGCAACGGT CATCGGCGGC
ACCTTGCTAA GTCTTCCGCT GATGTGGCAA CTGGCAGATA TCATAATGGC CTGCATGGCT
ATTACCAATT TGACCGCCAT TTTACTGCTC TCGCCTGTGG TTCATACCAT TGCCAGTGAT
TATCTACGCC AGCGTAAACT CGGCGTGCGC CCGGTGTTTG ATCCGTTGCG TTATCCGGAT
ATCGGCCGCC AGCTTTCTCC GGACGCGTGG GATGATGTTT CGCAGGAGTA A
 
Protein sequence
MPDFFSFINS VLWGSVMIYL LFGAGCWFTF RTGFVQFRYI RQFGKSLKNS IHPQPGGLTS 
FQSLCTSLAA RVGSGNLAGV ALAITAGGPG AVFWMWVAAF IGMATSFAEC SLAQLYKERD
ANGQFRGGPA WYMARGLGMR WMGVLFAVFL LIAYGIIFSG IQANAVARAL SFSFDFPPLV
TGIILAVFAL LAITRGLHGV ARLMQGFVPL MAIIWVLTSL VICVMNIGQL PHVIWSIFES
AFGWQEAAGG AAGYTLSQAI TNGFQRSMFS NEAGMGSTPN AAAAAASWPP HPAAQGIVQM
IGIFIDTLVI CTASAMLILL AGNGTTYMPL EGIQLIQKAM RVLMGSWGAE FVTLVVILFA
FSSIVANYIY AENNLFFLRL NNPKAIWCLR ICTFATVIGG TLLSLPLMWQ LADIIMACMA
ITNLTAILLL SPVVHTIASD YLRQRKLGVR PVFDPLRYPD IGRQLSPDAW DDVSQE