Gene Cagg_0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0024 
Symbol 
ID7269021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp38706 
End bp40424 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content58% 
IMG OID643564897 
ProductFormate--tetrahydrofolate ligase 
Protein accessionYP_002461413 
Protein GI219846980 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCA GCCTGCAAAT CGCTGCCGAA GCCCACCTCG AACCGATCGG TGTCATCGCC 
GAACGACTCG GTCTGCCCAC CGAGTATCTC GAACCGTATG GGCGCTACCG CGGCAAGATC
GATCTTACGT TTCTCGACGA TCACGCCAAT CGTCCACGTG GTCGCTATAT TCTGGTTAGC
GCCATTACGC CGACTCCGCT TGGTGAAGGG AAAACAACCA CTGCCATTGG GTTGGCGATG
GCTCTCAACC GGATCGGTAA ACGCGCCGTC GTTACGTTAC GTCAATCATC ACTAGGACCG
GTGTTTGGCA TTAAAGGCGG TGGCGCGGGT GGTGGCTATA GTCAAATCGT CCCATTAGCC
GAGAGCATAC TGCATCTCAA CGGTGATATT CACGCCGTCT CACAAGCGCA CAACCAGTTG
GCGGCCCTTA CCGACAACAG TTGGTATCAC GGCAACCCTC TCGACATCGA TCCCGATCGG
ATCGAGATTC GGCGTGTTGT GGATGTCAAC GATCGCTTTT TGCGACAGGT CATGATCGGG
TTAGGTGGCA AGCAAAACGG TTTTCCGCGC CAGACCGGCT TCGACATTAG CGTCGCCAGT
GAACTCATGG CGATCCTCGC GATGGTCAGT GGAGCAGGCG CAAAAGCCGC GCTCCGCGAG
CTGCGCGCCC GTATCGGCCG GATGGTGGTG GCCTTTCGCC GCGACGGTAC ACCGGTAACA
GCCGAGGATG TACGTGGGGC GGGTGCAGCA ACGGTGCTGA TGCGCGAGGC GCTGAAGCCC
AACCTGATGC AGACCATCGA AAACACACCG GCCTTGATCC ACGCCGGACC CTTTGCCAAC
ATCGCACAGG GCAACTCTTC CATTCTGGCC GATCTCGTGG CGCTCCGTTG TGCCGAGTAT
ACCATTACCG AGGCCGGTTT CGGCGCCGAT ATCGGAGCAG AAAAATTCTT CAATCTCAAA
TGCCGAGCCG GTGGTCTCTG GCCTGACGCT GCGGTCATTG TAGCCACCGT TCGCGCGCTC
AAAGCTCACA GCGGCAAGTA TGAGATCGTC GCCGGCAAAC CACTGCCGGT AGCGTTACTC
CAAGAAAATC CCGACGATGT CTTCGCGGGT GGTGATAACC TCCGCCGACA AATCGCTAAC
ATCACCCAAT TCGGTGTGCC GGTCGTTGTC GCGCTGAACA CCTATCCCGA AGATACTGCG
ACAGAGATCG AAGCCGTCGC CCAGATCGCT ACCGCAGCCG GTGCTGTCGG TATGGCCGTG
AGCAACGTTT ACGCTGCAGG TGGAGCGGGA GGTGTCGAAT TGGCTAAACT GGTGGCCCGT
GTCACCGAAC GACCCGGCCC GCGCGAACCG AAATATCTCT ACCCGCTCGA AATGCCATTA
GCCGAGAAGA TAGAAGTGAT CGCTCGCCGC ATCTATGGTG CAGCCGGCAT TGAGCTGAGC
GCGACGGCTG CTGCGCAACT GGCAACCTTG ACGGAAGCCG GGTTTGGCAA CTTGCCTATC
TGTATGGTGA AGACCCATCT CAGCCTCAGC CACGATCCCA AACTGCGCGG GGCACCAGCA
GGTTTCATTT TTCCTATTCG TGAGGTGCGG ATTAGCGCCG GTGCCGGGTT CATTTTACCC
ATTGCCGGCA CAACCGTGAC TATGCCGGGC TTGGGCGCGC ATCCGGCAGC CCATCAGGTT
GACATTGACG ATGATGGTAA CATCGTGGGT TTATTTTAA
 
Protein sequence
MKTSLQIAAE AHLEPIGVIA ERLGLPTEYL EPYGRYRGKI DLTFLDDHAN RPRGRYILVS 
AITPTPLGEG KTTTAIGLAM ALNRIGKRAV VTLRQSSLGP VFGIKGGGAG GGYSQIVPLA
ESILHLNGDI HAVSQAHNQL AALTDNSWYH GNPLDIDPDR IEIRRVVDVN DRFLRQVMIG
LGGKQNGFPR QTGFDISVAS ELMAILAMVS GAGAKAALRE LRARIGRMVV AFRRDGTPVT
AEDVRGAGAA TVLMREALKP NLMQTIENTP ALIHAGPFAN IAQGNSSILA DLVALRCAEY
TITEAGFGAD IGAEKFFNLK CRAGGLWPDA AVIVATVRAL KAHSGKYEIV AGKPLPVALL
QENPDDVFAG GDNLRRQIAN ITQFGVPVVV ALNTYPEDTA TEIEAVAQIA TAAGAVGMAV
SNVYAAGGAG GVELAKLVAR VTERPGPREP KYLYPLEMPL AEKIEVIARR IYGAAGIELS
ATAAAQLATL TEAGFGNLPI CMVKTHLSLS HDPKLRGAPA GFIFPIREVR ISAGAGFILP
IAGTTVTMPG LGAHPAAHQV DIDDDGNIVG LF