Gene EcSMS35_4685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4685 
SymbolcycA 
ID6146330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4782999 
End bp4784402 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content51% 
IMG OID641619501 
ProductD-alanine/D-serine/glycine permease 
Protein accessionYP_001746609 
Protein GI170682794 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000407821 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGATC AGGTAAAAGT CGTTGCCGAT GATCAGGCTC CGGCTGAACA GTCGCTACGG 
CGCAATCTCA CAAACCGACA TATTCAGCTT ATTGCCATTG GCGGTGCCAT TGGTACAGGG
TTGTTTATGG GGTCCGGCAA AACGATTAGC CTTGCCGGGC CGTCGATCAT TTTCGTTTAT
ATGATCATCG GTTTTATGCT CTTTTTCGTG ATGCGGGCAA TGGGGGAATT GCTGCTTTCG
AATCTGGAAT ACAAATCTTT TAGTGACTTC GCTTCCGATT TACTCGGGCC GTGGGCAGGA
TATTTCACCG GCTGGACTTA CTGGTTCTGC TGGGTTGTAA CCGGTATGGC AGACGTGGTT
GCCATTACCG CCTATGCGCA ATTCTGGTTC CCTGGGCTTT CTGACTGGGT TGCTTCGTTA
TCCGTGATCA TTCTGTTACT GGTTCTAAAC CTCGCCACGG TAAAAATGTT CGGTGAGATG
GAGTTCTGGT TTGCGATGAT CAAAATCGTC GCCATCGTGT CGCTGATTGT TGTCGGCCTG
GTCATGGTGG CGATGCACTT TCAGTCACCG ACCGGTGTGG AAGCATCATT TGCACATTTG
TGGAATGACG GCGGCTGGTT CCCGAAAGGC TTAAGTGGCT TCTTTGCTGG ATTCCAGATA
GCGGTTTTCG CTTTCGTAGG GATTGAGCTG GTAGGTACCA CCGCTGCGGA AACCAAAGAT
CCAGAGAAAT CACTGCCACG CGCGATTAAC TCCATTCCGA TCCGTATCAT TATGTTCTAC
GTCTTCTCGC TGATTGTGAT TATGTCCGTG ACGCCGTGGA GTTCGGTAGT CCCGGAGAAA
AGCCCGTTCG TTGAACTGTT TGTGTTGGTA GGTTTGCCTG CGGCTGCCAG CGTGATCAAC
TTTGTGGTGC TGACCTCTGC GGCGTCTTCC GCTAACAGCG GTGTCTTCTC TACCAGCCGT
ATGCTGTTTG GTCTGGCCCA GGAAGGTGTG GCACCGAAAG CGTTCGCTAA ACTCTCTAAG
CGCGCAGTAC CCGCGAAAGG GCTGACCTTC TCTTGTATCT GTCTGCTCGG CGGCGTGGTG
ATGTTGTATG TGAATCCCAG CGTGATTGGC GCGTTCACGA TGATTACAAC CGTTTCCGCG
ATTCTGTTTA TGTTTGTCTG GACGATTATC CTTTGCTCGT ACCTGGTGTA TCGCAAACAG
CGTCCTCATC TGCATGAGAA GTCGATCTAC AAGATGCCAC TCGGCAAGCT GATGTGCTGG
GTATGTATGG CGTTCTTTGT GTTTGTTCTG GTGTTGTTGA CACTGGAAGA TGACACCCGC
CAGGCGCTGC TGGTTACCCC GCTGTGGTTT ATCGCGCTGG GGCTGGGCTG GCTGTTTATT
GGTAAAAAAC GCATGGCGAA GTAA
 
Protein sequence
MVDQVKVVAD DQAPAEQSLR RNLTNRHIQL IAIGGAIGTG LFMGSGKTIS LAGPSIIFVY 
MIIGFMLFFV MRAMGELLLS NLEYKSFSDF ASDLLGPWAG YFTGWTYWFC WVVTGMADVV
AITAYAQFWF PGLSDWVASL SVIILLLVLN LATVKMFGEM EFWFAMIKIV AIVSLIVVGL
VMVAMHFQSP TGVEASFAHL WNDGGWFPKG LSGFFAGFQI AVFAFVGIEL VGTTAAETKD
PEKSLPRAIN SIPIRIIMFY VFSLIVIMSV TPWSSVVPEK SPFVELFVLV GLPAAASVIN
FVVLTSAASS ANSGVFSTSR MLFGLAQEGV APKAFAKLSK RAVPAKGLTF SCICLLGGVV
MLYVNPSVIG AFTMITTVSA ILFMFVWTII LCSYLVYRKQ RPHLHEKSIY KMPLGKLMCW
VCMAFFVFVL VLLTLEDDTR QALLVTPLWF IALGLGWLFI GKKRMAK