Gene Cagg_3067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3067 
Symbol 
ID7269484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3728965 
End bp3730434 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content60% 
IMG OID643567887 
Productpolynucleotide adenylyltransferase/metal dependent phosphohydrolase 
Protein accessionYP_002464361 
Protein GI219849928 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.263831 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000430788 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGATGT ATCCGAGCGA ATTGCAAACA ACCGATCTCT TCCAGTTCAT CGCCATGCAC 
AGTGACCAAC CGGTGTGGCT GGTGGGCGGT TCGGTGCGCG AATTGCTGGC CGGCCGCCAG
CCAGCCGATA TTGATCTTGC CGTCGCCGGG AGCGGGCTGG ACTTGGCGAA AACGTTAGCT
ACAGCCGGGG GTGGGACATT CGTTGCGCTC GACGACGAGC GAGACACCGG TCGGGCCGTC
TTACCCGGTG GGGAGACTAT CGATTGTGCA AGATTGCGTG CAGCCGACAT CATTGGTGAT
CTCCGACTAC GCGACTTTAC GATCAATGCG TTGGCACTCC CGCTAGCGGC AGCCATTCGA
GGTGACTGGC ATGATCTCAT CGATCCACTT GGCGGACAAG CCGATCTGGC TGCCGGTCGT
TTGCGTCTCT GTCTGCCCAC CGGTTTGCGT GAAGATCCGT TACGAGTCGT GCGCGCCGGA
CGTTTTCGGT CTACCCACCA TCTCACACCC GATCCCGAAC TCATCACGAT GGCACAACAG
GCAGCACCTG CGCTCGCTAC GGTCGCAGTA GAACGGATTC GCGACGAGAT TCTCAAATTG
TGTGATGGTT CGGCAGCAGC AGCCGGCTTG CGCTTCCTCG ACGAAGTCAG GGCGCTCACC
GTCATCTTCC CCGAACTCGA AGCAGCGCGC GACTGTGAAC AACCGTATGT GCATTTTCTC
CCGGTATTGG CGCACATCTT GGAAACGGTG GCTGCCCTTG ACTGGCTGAT CGACAACGGT
GAACCACCGG TGGCGGTACA AACCAACCCG CACTTAAGCC GCCGGTTACC CTTTGCCGAG
CGGTATCACG AGCACCTTCA CCGGCGGCGA GGAATAGTAC GGCGGGCAGC ACTGCTCAAA
CTCGCGGCCC TCCTTCACGA CAACGCCAAA CCGCAGACAA AGGTTCATCA CCCTGACGGT
ACGGTTACCT TCTACGGCCA CCAAAGCCTG GGCGCCGAAG TGGCTGCTCA GATCGGCAAA
CGGCTCCGCC TCAGTCGCAC CGACACAGCT TATATCGTGA CGATTGTTCG TGAACACATG
CGACCGGGGC AGATGCGGAG CGGTGGGCAA CTAACCGAAC GCGGGATCAA CCGCTTCTTT
CGCGATACCG GCGATGCCGG GCCAGATGTG TTGTTACATG AACTAGCCGA TCATCTAGCG
ACCCGCGGCC CGTGGCTCGA TCCGAGCGCA TGGCACAACC ATCTTGTGTG GGTTGGCGAG
CTGCTAGACC GTTATTGGAA TGCTCCAACG CCACCGCCAC CACCCCTGCT GCGTGGTGAT
GAGCTGATGG CAAGCTTGGG GATCGGCCCA GGGCCGGAAG TGGGCCGATT GCTGCGATTG
ATTCTTGAGG CACAACAGGC TGGTGAAATT CACAATCTCG AAGAGGCGCT CACGCTGGCA
CGAACACTCC ATCGCGCAGA GCGCTTGTAG
 
Protein sequence
MAMYPSELQT TDLFQFIAMH SDQPVWLVGG SVRELLAGRQ PADIDLAVAG SGLDLAKTLA 
TAGGGTFVAL DDERDTGRAV LPGGETIDCA RLRAADIIGD LRLRDFTINA LALPLAAAIR
GDWHDLIDPL GGQADLAAGR LRLCLPTGLR EDPLRVVRAG RFRSTHHLTP DPELITMAQQ
AAPALATVAV ERIRDEILKL CDGSAAAAGL RFLDEVRALT VIFPELEAAR DCEQPYVHFL
PVLAHILETV AALDWLIDNG EPPVAVQTNP HLSRRLPFAE RYHEHLHRRR GIVRRAALLK
LAALLHDNAK PQTKVHHPDG TVTFYGHQSL GAEVAAQIGK RLRLSRTDTA YIVTIVREHM
RPGQMRSGGQ LTERGINRFF RDTGDAGPDV LLHELADHLA TRGPWLDPSA WHNHLVWVGE
LLDRYWNAPT PPPPPLLRGD ELMASLGIGP GPEVGRLLRL ILEAQQAGEI HNLEEALTLA
RTLHRAERL