Gene EcolC_3798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3798 
Symbol 
ID6067262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4157442 
End bp4158854 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content53% 
IMG OID641603211 
ProductD-alanine/D-serine/glycine permease 
Protein accessionYP_001726730 
Protein GI170021776 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000560112 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.948087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGATC AGGTAAAAGT CGTTGCCGAT GATCAGGCTC CGGCTGAACA GTCGCTACGG 
CGCAATCTCA CAAACCGACA TATTCAGCTT ATTGCCATTG GCGGTGCCAT TGGTACGGGG
TTGTTTATGG GGTCTGGCAA AACGATTAGC CTTGCCGGGC CGTCGATCAT TTTCGTTTAT
ATGATCATCG GTTTTATGCT CTTTTTCGTG ATGCGGGCAA TGGGGGAATT GCTGCTTTCG
AATCTGGAAT ACAAATCTTT TAGTGACTTC GCTTCCGATT TACTCGGGCC GTGGGCAGGA
TATTTCACCG GCTGGACTTA CTGGTTCTGC TGGGTTGTAA CCGGTATGGC AGACGTGGTG
GCGATCACTG CTTATGCTCA GTTCTGGTTC CCCGATCTCT CCGACTGGGT CGCCTCGCTG
GCGGTGATAG TGCTGCTGTT GACGCTCAAC CTTGCCACCG TGAAAATGTT CGGTGAGATG
GAGTTCTGGT TTGCGATGAT CAAAATCGTC GCCATCGTGT CGCTGATTGT CGTCGGCCTG
GTCATGGTGG CGATGCACTT TCAGTCACCG ACCGGTGTGG AAGCGTCATT CGCGCATTTG
TGGAATGACG GCGGCTGGTT CCCGAAAGGT TTAAGTGGCT TCTTTGCCGG ATTCCAGATA
GCGGTTTTCG CTTTCGTGGG GATTGAGCTG GTAGGTACAA CAGCTGCGGA AACCAAAGAT
CCGGAGAAAT CACTGCCACG CGCGATTAAC TCCATTCCGA TCCGTATCAT TATGTTCTAC
GTCTTCGCGC TGATTGTGAT TATGTCCGTG ACGCCGTGGA GTTCGGTAGT CCCGGAGAAA
AGCCCGTTTG TTGAACTGTT CGTGTTGGTA GGGCTGCCTG CTGCCGCAAG CGTGATCAAC
TTTGTGGTGC TGACCTCTGC GGCGTCTTCC GCTAACAGCG GCGTCTTCTC TACCAGCCGT
ATGCTGTTTG GTCTGGCGCA GGAAGGTGTG GCACCGAAAG CGTTCGCTAA ACTTTCTAAG
CGCGCAGTAC CAGCGAAAGG GCTGACCTTC TCCTGCATCT GCCTGCTCGG TGGCGTGGTG
ATGCTGTATG TGAATCCTAG CGTGATTGGC GCGTTCACGA TGATTACAAC CGTTTCCGCG
ATTCTGTTTA TGTTCGTCTG GACGATTATC CTTTGCTCGT ACCTGGTTTA CCGCAAACAG
CGTCCTCATC TGCATGAGAA GTCGATCTAC AAGATGCCGC TCGGCAAGCT GATGTGCTGG
GTATGTATGG CGTTCTTTGT GTTCGTGGTC GTGTTGCTGA CACTGGAAGA TGACACTCGC
CAGGCGCTGT TGGTCACCCC GCTGTGGTTT ATCGCACTGG GGCTGGGCTG GTTGTTTATT
GGTAAGAAGC GGGCTGCTGA ACTGCGGAAA TAA
 
Protein sequence
MVDQVKVVAD DQAPAEQSLR RNLTNRHIQL IAIGGAIGTG LFMGSGKTIS LAGPSIIFVY 
MIIGFMLFFV MRAMGELLLS NLEYKSFSDF ASDLLGPWAG YFTGWTYWFC WVVTGMADVV
AITAYAQFWF PDLSDWVASL AVIVLLLTLN LATVKMFGEM EFWFAMIKIV AIVSLIVVGL
VMVAMHFQSP TGVEASFAHL WNDGGWFPKG LSGFFAGFQI AVFAFVGIEL VGTTAAETKD
PEKSLPRAIN SIPIRIIMFY VFALIVIMSV TPWSSVVPEK SPFVELFVLV GLPAAASVIN
FVVLTSAASS ANSGVFSTSR MLFGLAQEGV APKAFAKLSK RAVPAKGLTF SCICLLGGVV
MLYVNPSVIG AFTMITTVSA ILFMFVWTII LCSYLVYRKQ RPHLHEKSIY KMPLGKLMCW
VCMAFFVFVV VLLTLEDDTR QALLVTPLWF IALGLGWLFI GKKRAAELRK