Gene EcHS_A2466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2466 
SymbolfolC 
ID5592415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2473245 
End bp2474513 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content56% 
IMG OID640921587 
Productbifunctional folylpolyglutamate synthase/ dihydrofolate synthase 
Protein accessionYP_001459121 
Protein GI157161803 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones70 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTATCA AACGCACTCC TCAAGCCGCG TCGCCTCTGG CTTCGTGGCT TTCTTATCTG 
GAAAACCTGC ACAGTAAAAC TATCGATCTC GGCCTTGAGC GCGTGAGCCT GGTCGCGGCG
CGTCTTGGCG TCCTGAAACC AGCGCCATTT GTGTTTACCG TTGCGGGTAC GAATGGCAAA
GGCACCACCT GCCGTACGCT GGAGTCGATT CTGATGGCGG CAGGGTACAA AGTGGGCGTC
TACAGTTCGC CTCATCTGGT GCGTTATACC GAGCGCGTAC GTGTGCAGGG CCAGGAATTG
CCGGAATCGG CCCACACCGC CTCTTTTGCG GAGATTGAAT CGGCACGCGG TGATATTTCC
CTGACCTATT TCGAGTACGG TACGCTGTCG GCGTTGTGGC TGTTCAAGCA GGCACAACTT
GACGTGGTGA TTCTGGAAGT AGGGCTGGGC GGTCGTCTGG ACGCAACCAA TATTGTCGAC
GCCGATGTCG CGGTAGTAAC CAGTATTGCG CTGGATCATA CCGACTGGCT GGGTCCAGAT
CGCGAAAGTA TTGGTCGCGA GAAAGCAGGC ATCTTCCGCA GCGAAAAACC GGCAATTGTC
GGTGAGCCGG AAATGCCTTC TACCATTGCT GATGTGGCGC AGGAAAAAGG TGCACTGTTA
CAACGTCGGG GCGTTGAGTG GAACTATTCC GTCACCGATC ATGACTGGGC GTTTAGCGAT
GCTCACGGCA CGCTGGAAAA TCTGCCGTTG CCGCTTGTCC CGCAACCGAA TGCCGCAACA
GCGCTGGCGG CACTGCGTGC CAGCGGGCTG GAAGTCAGTG AAAATGCCAT TCGCGACGGG
ATTGCCAGCG CAATTTTGCC GGGACGTTTC CAGATTGTGA GCGAGTCGCC ACGCGTTATT
TTTGATGTCG CGCATAATCC ACATGCGGCG GAATATCTCA CCGGGCGTAT GAAAGCGCTA
CCGAAAAACG GGCGCGTGCT GGCGGTTATC GGTATGCTAC ATGATAAAGA TATTGCCGGA
ACTCTGGCCT GGTTGAAAAG CGTGGTTGAT GACTGGTATT GTGCGCCACT GGAAGGGCCG
CGCGGTGCCA CGGCAGAACA ACTGCTTGAG CATTTGGGTA ACGGCAAATC ATTTGATAGC
GTTGCGCAGG CATGGGATGC CGCAATGGCG GACGCTAAAG CGGAAGACAC CGTGCTGGTG
TGTGGTTCTT TCCACACGGT CGCACATGTC ATGGAAGTGA TTGACGCGAG GAGAAGCGGT
GGCAAGTAA
 
Protein sequence
MIIKRTPQAA SPLASWLSYL ENLHSKTIDL GLERVSLVAA RLGVLKPAPF VFTVAGTNGK 
GTTCRTLESI LMAAGYKVGV YSSPHLVRYT ERVRVQGQEL PESAHTASFA EIESARGDIS
LTYFEYGTLS ALWLFKQAQL DVVILEVGLG GRLDATNIVD ADVAVVTSIA LDHTDWLGPD
RESIGREKAG IFRSEKPAIV GEPEMPSTIA DVAQEKGALL QRRGVEWNYS VTDHDWAFSD
AHGTLENLPL PLVPQPNAAT ALAALRASGL EVSENAIRDG IASAILPGRF QIVSESPRVI
FDVAHNPHAA EYLTGRMKAL PKNGRVLAVI GMLHDKDIAG TLAWLKSVVD DWYCAPLEGP
RGATAEQLLE HLGNGKSFDS VAQAWDAAMA DAKAEDTVLV CGSFHTVAHV MEVIDARRSG
GK