Gene Cagg_0767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0767 
Symbol 
ID7268086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp951939 
End bp953009 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content55% 
IMG OID643565618 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_002462127 
Protein GI219847694 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.762459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAC CCTCGATACG TGAAATGCAC GAGGCGCTCG AAGCGCGCAT CCTCTCACCT 
TACGCCGCCA AAAGCGCTGC TGCCGTGCGC GACCAACCGG AACCGCCATG TCCGATCCGT
ACCGCTTACC AGCGTGACCG TGACCGTATT TTGCACTCCA AACCGTTTCG TCGGCTTAAA
CACAAAACAC AGGTATTTAT CGCACCCCTC GGTGACCACT ACCGTACTCG CCTGACCCAT
ACCCTCGAAG TGACGCAAAT TGCTCGCACG GTGGCGCGTG CTCTGCGGCT TAATGAAGAC
CTGACCGAAG CGATCGGTCT TGGGCACGAC ATTGGTCATG CTCCCTTCGG GCATGCCGGT
GAGACGGCGC TGAGTCGGAT CTGCCCCGGT CACTTTCGCC ACAACGAACA ATCACTGCGC
ATTGTGGAAG TCCTTGAAAA CGGGGGAGCC GGCCTGAATC TCACGTTTGC GGTGCGCGAG
GGCATCTATA TGCACTCAAA GGTGCAGCGC GACATCACCG CTAAAGCCTG GGGGATAGCC
AGCACACTTG AAGGTCAGAT CATTAAAATC TGCGATAGTA TCGCCTATAT CAACCACGAT
ATTGACGATG CAATACGTGC CGGCATTCTA CGAACCGAAG ACTTACCTGC CGATTGCATT
GCCATCCTCG GCGACACCCA TAGCAAACGA CTGGCCACGA TGGTTAGTGA CATGATCTAC
CACAACTGGT GGGCAACCGG CGAGGGAACG GCTCCTGATA CCCTTACGCT ATCGATGAGT
CCGACTATCT TAGCTGCCAC CAACAAACTG CGTCATTTTC TGTATGAGAC GGTCTACCAC
CGGCCAGAAG CCAAAGCCGA GAATGAAAAG GTTCGTTTCA TTATCGAAAC GCTGTACGAC
TATTTTGTGC GCCATCCCGA AGCGATCCCG GCTGAACTGA TGGCAGTCGT TGAACGGCGA
GGCGAACCGG TTGAACAAGC GGTTGTCGAT TACATTGCCG GTATGACCGA CCGGTACGCA
CTCACCGTCT TCAAACGTAT CTTCGTACCC CGCACGTGGG GTACGCTCTA G
 
Protein sequence
MSQPSIREMH EALEARILSP YAAKSAAAVR DQPEPPCPIR TAYQRDRDRI LHSKPFRRLK 
HKTQVFIAPL GDHYRTRLTH TLEVTQIART VARALRLNED LTEAIGLGHD IGHAPFGHAG
ETALSRICPG HFRHNEQSLR IVEVLENGGA GLNLTFAVRE GIYMHSKVQR DITAKAWGIA
STLEGQIIKI CDSIAYINHD IDDAIRAGIL RTEDLPADCI AILGDTHSKR LATMVSDMIY
HNWWATGEGT APDTLTLSMS PTILAATNKL RHFLYETVYH RPEAKAENEK VRFIIETLYD
YFVRHPEAIP AELMAVVERR GEPVEQAVVD YIAGMTDRYA LTVFKRIFVP RTWGTL