Gene EcolC_1337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1337 
Symbol 
ID6068235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1467981 
End bp1469249 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content56% 
IMG OID641600759 
Productbifunctional folylpolyglutamate synthase/ dihydrofolate synthase 
Protein accessionYP_001724330 
Protein GI170019376 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.887259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATCA AACGCACTCC TCAAGCCGCG TCGCCTCTGG CTTCGTGGCT TTCTTATCTG 
GAAAACCTGC ACAGTAAAAC TATCGATCTC GGCCTTGAGC GCGTGAGCCA GGTCGCGGCG
CGTCTTGGCG TCCTGAAACC AGCGCCATTT GTGTTTACCG TTGCGGGTAC GAATGGCAAA
GGCACCACCT GCCGTACGCT GGAGTCGATT CTGATGGCGG CAGGGTACAA AGTGGGCGTC
TACAGTTCAC CGCATCTGGT GCGTTATACC GAGCGCGTAC GTGTGCAGGG GCAGGAATTG
CCGGAATCGG CCCACACCGC CTCTTTTGCG GAGATTGAAT CGGCACGCGG TGATATTTCC
CTGACCTATT TCGAGTACGG TACGCTGTCG GCGTTGTGGC TGTTCAAACA GGCACAACTT
GACGTAGTGA TTCTGGAAGT AGGGCTGGGC GGTCGTCTGG ACGCAACCAA TATTGTCGAT
GCCGATGTGG CTGTAGTAAC CAGCATTGCG CTGGATCATA CCGACTGGCT GGGACCAGAT
CGCGAAAGTA TTGGTCGCGA GAAAGCAGGC ATCTTTCGCA GCGCAAAACC GGCAATTGTC
GGTGAGCCGG AAATGCCTTC TACCATTGCT GATGTGGCGC AGGAAAAAGG TGCACTGTTA
CAACGTCGGG GCGTTGAGTG GAACTATTCC GTCACCGATC ATGACTGGGC GTTTAGCGAT
GCTCACGGCA CGCTGGAAAA TCTGCCGTTG CCGCTTGTCC CGCAACCGAA TGCCGCAACA
GCGCTGGCGG CACTGCGTGC CAGCGGGCTG GAAGTCAGTG AAAATGCCAT TCGCGACGGG
ATTGCCAGCG CAATTTTGCC GGGACGTTTC CAGATTGTGA GCGAGTCGCC ACGCGTTATT
TTTGATGTCG CGCATAATCC ACATGCGGCG GAATATCTCA CCGGGCGTAT GAAAGCGCTA
CCGAAAAACG GGCGCGTGCT GGCGGTTATC GGTATGCTAC ATGATAAAGA TATTGCCGGA
ACTCTGGCCT GGTTGAAAAG CGTGGTTGAT GACTGGTATT GTGCGCCACT GGAAGGGCCG
CGCGGTGCCA CGGCAGAACA ACTGCTTGAG CATTTGGGTA ACGGCAAATC ATTTGATAGC
GTTGCGCAGG CATGGGATGC CGCAATGGCG GACGCTAAAG CGGAAGACAC CGTGCTGGTG
TGTGGTTCTT TCCACACGGT CGCACATGTC ATGGAAGTGA TTGACGCGAG GAGAAGCGGT
GGCAAGTAA
 
Protein sequence
MIIKRTPQAA SPLASWLSYL ENLHSKTIDL GLERVSQVAA RLGVLKPAPF VFTVAGTNGK 
GTTCRTLESI LMAAGYKVGV YSSPHLVRYT ERVRVQGQEL PESAHTASFA EIESARGDIS
LTYFEYGTLS ALWLFKQAQL DVVILEVGLG GRLDATNIVD ADVAVVTSIA LDHTDWLGPD
RESIGREKAG IFRSAKPAIV GEPEMPSTIA DVAQEKGALL QRRGVEWNYS VTDHDWAFSD
AHGTLENLPL PLVPQPNAAT ALAALRASGL EVSENAIRDG IASAILPGRF QIVSESPRVI
FDVAHNPHAA EYLTGRMKAL PKNGRVLAVI GMLHDKDIAG TLAWLKSVVD DWYCAPLEGP
RGATAEQLLE HLGNGKSFDS VAQAWDAAMA DAKAEDTVLV CGSFHTVAHV MEVIDARRSG
GK