Gene TM1040_2675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2675 
Symbol 
ID4077586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2810634 
End bp2811905 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content66% 
IMG OID638007999 
ProductFolC bifunctional protein 
Protein accessionYP_614669 
Protein GI99082515 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC AGACCTCCGA TGCCATTCTC GCCCGCATGA TGGCGCTGCA CCCCAAGATC 
ATCGACCTGA CGCTGGATCG GGTCTGGCGT CTTCTGGCGG CGCTTGAGAA CCCGCAGGAC
AAGCTGCCCC CGGTGATCCA TATCGCGGGC ACCAACGGCA AAGGCTCGAC GCAGGCGATG
ATCCGCGCCG GGCTTGAGGG CTGGGGCAAA TCGGTGCACG CCTATACCTC ACCGCATCTG
GCGCGCTTTC ATGAACGCAT CCGTCTGGCG GGCGATCTGA TCTCAGAGGC GCATCTCACG
GAGGTGCTGG ACGAGTGCTA CAGCGCCAAT GGCGACGCAA GCATCACCTA TTTCGAGATC
ACCACCGTCG CGGGGCTTCT GGCGTTTTCA CGCACACCCG CCGACTACAC CCTCCTCGAG
GTTGGCCTTG GCGGGCGTCT GGATGCCACC AATGTCATCA CGCCCGAGGT CTCTGTAATC
ACGCCGGTCT CGATCGATCA CGAGCAATTC CTCGGCAATA CGCTGGCCAA GATCGCAGGC
GAGAAGGCCG GGATCATCAA ACGCGGCGTG CCCGTGGTGG TCGGCCCCCA GGCCGAAGAG
GCGATGGAGG TCATCGAGGA CACCGCGATG CGCCTGGGCG CCCCCCTGAT CGCCTATGGC
CAGCACTGGC ACGTCCACGA GGAACGCGGC CGTCTCGTCT ATCAGGACGA GCGCGGCCTG
CTGGACCTGC CGCTGCCCAA CCTGATGGGC GCGCATCAGA TCCAGAACGC CGGCGCCGCC
CTTGCGGTGC TGCGCCACCT TGGCGCGGAT GAGGCCGCCT GCGAGGCGGC TGTTGTGGGG
GCACAGTGGC CCGCCCGTAT GCAGCGCCTC AAAACCGGCC CGCTGGTGGA AATCGCAGGC
GCGGCCGAGC TGTGGCTCGA TGGCGGCCAC AACGCCGCTG CCGGGATTGC TCTCGCCGAT
GTGCTGGCCA AGCTGCCCAA ACGCCCGACG CATCTGATTT GCGGTATGCT CAATACCAAG
GATGTCACAG GCTACCTGCG CCCGCTCGCC GCCGAGGCCC AGAGCCTCAC GGCGGTGTCG
ATCCCGGGCG AGGCGGCAAC CCTCTCGGCC GAGGAAACCA AAGCGGCGGC CAGTTCCGTG
GACCTTCCAG CAACCACTGC GGAGTCTGTT GCCGAGGCGC TCACCGCCAT TCTGGCGCGC
GACCCAGACA GCCGCGTGCT GATCTGCGGC TCGCTCTATC TGGCGGGCAA TATCCTGCGC
GAGAACGGCT AG
 
Protein sequence
MTEQTSDAIL ARMMALHPKI IDLTLDRVWR LLAALENPQD KLPPVIHIAG TNGKGSTQAM 
IRAGLEGWGK SVHAYTSPHL ARFHERIRLA GDLISEAHLT EVLDECYSAN GDASITYFEI
TTVAGLLAFS RTPADYTLLE VGLGGRLDAT NVITPEVSVI TPVSIDHEQF LGNTLAKIAG
EKAGIIKRGV PVVVGPQAEE AMEVIEDTAM RLGAPLIAYG QHWHVHEERG RLVYQDERGL
LDLPLPNLMG AHQIQNAGAA LAVLRHLGAD EAACEAAVVG AQWPARMQRL KTGPLVEIAG
AAELWLDGGH NAAAGIALAD VLAKLPKRPT HLICGMLNTK DVTGYLRPLA AEAQSLTAVS
IPGEAATLSA EETKAAASSV DLPATTAESV AEALTAILAR DPDSRVLICG SLYLAGNILR
ENG