Gene EcSMS35_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2471 
SymbolfolC 
ID6145733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2518539 
End bp2519807 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content56% 
IMG OID641617343 
Productbifunctional folylpolyglutamate synthase/ dihydrofolate synthase 
Protein accessionYP_001744515 
Protein GI170682976 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATCA AACGCACTCC TCAAGCCGCG TCGCCTCTGG CTTCGTGGCT TTCTTATCTG 
GAAAACCTGC ACAGTAAAAC TATCGATCTC GGCCTTGAGC GCGTGAGCCA GGTCGCGGCG
CGTCTTGGCG TTCTGAAACC AGCGCCATTT GTGTTTACCG TTGCGGGTAC GAATGGCAAA
GGCACCACTT GCCGTACGCT GGAGTCGATT CTGATGGCGG CAGGGTACAA AGTGGGCGTC
TACAGTTCGC CGCATCTGGT GCGTTATACC GAGCGCGTAC GCGTGCAGGG CCAGGAATTG
CCGGAATCGG CCCACACCGC CTCTTTTGCG GAGATTGAAT CGGCACGCGG TGATATTTCC
TTGACCTATT TCGAGTACGG TACGCTGTCG GCGTTGTGGC TGTTCAAACA GGCACAACTT
GACGTGGTGA TTCTGGAAGT AGGGCTGGGC GGTCGTCTGG ACGCAACCAA TATTGTCGAT
GCCGATGTCG CGGTAGTAAC CAGCATTGCG CTGGATCATA CCGACTGGCT GGGGCCAGAT
CGCGAAAGTA TTGGTCGCGA GAAAGCAGGC ATCTTCCGCA GCGAAAAACC GGCAATTGTC
GGTGAGCCGG AAATGCCTTC TACCATTGCT GATGTGGCGC AGGAAAAAGG TGCAATGCTA
CAACGTCGGG GCGTTGAGTG GAACTATTCC GTCACCGATC ATGACTGGAC GTTTAGCGAT
GCTCACGGCA CGCTGGAAAA TCTGCCGTTG CCGCTTGTCC CGCAACCGAA TGCCGCAACG
GCGCTGGCGG CACTGCGTGC CAGCGGGCTG GAGGTCAGTG AAAATGCCAT TCGCGACGGG
ATTGCCAGCG CAATTTTGCC AGGACGTTTC CAGATTGTGA GCGAGTCGCC ACGCGTTATT
TTTGATGTCG CGCATAATCC ACATGCGGCG GAATATCTCA CCGGGCGTAT GAAAGCGCTA
CCGAAAAACG GGCGCGTGCT GGCGGTTATC GGTATGCTAC ATGATAAAGA TATTGCCGGA
ACTCTGGCCT GGTTGAAAAG CGTGGTTGAT GACTGGTATT GTGCGCCCCT GGAAGGGCCG
CGCGGTGCCA CGGCAGAACA ACTGCTTGAG CATTTGGGTA ACGGCAAATC ATTTGATAGC
GTGGCGCAGG CATGGGATGC CGCAATGGCG GACGCTAAAG CGGAAGATAC CGTGCTGGTG
TGTGGTTCGT TCCACACGGT CGCACATGTC ATGGAAGTGA TTGACGCGAG GAGAAGCGGT
GGCAAGTAA
 
Protein sequence
MIIKRTPQAA SPLASWLSYL ENLHSKTIDL GLERVSQVAA RLGVLKPAPF VFTVAGTNGK 
GTTCRTLESI LMAAGYKVGV YSSPHLVRYT ERVRVQGQEL PESAHTASFA EIESARGDIS
LTYFEYGTLS ALWLFKQAQL DVVILEVGLG GRLDATNIVD ADVAVVTSIA LDHTDWLGPD
RESIGREKAG IFRSEKPAIV GEPEMPSTIA DVAQEKGAML QRRGVEWNYS VTDHDWTFSD
AHGTLENLPL PLVPQPNAAT ALAALRASGL EVSENAIRDG IASAILPGRF QIVSESPRVI
FDVAHNPHAA EYLTGRMKAL PKNGRVLAVI GMLHDKDIAG TLAWLKSVVD DWYCAPLEGP
RGATAEQLLE HLGNGKSFDS VAQAWDAAMA DAKAEDTVLV CGSFHTVAHV MEVIDARRSG
GK