Gene Cagg_0476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0476 
Symbol 
ID7266644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp586292 
End bp587152 
Gene Length861 bp 
Protein Length286 aa 
Translation table11 
GC content57% 
IMG OID643565339 
Productnitrogen-fixing NifU domain protein 
Protein accessionYP_002461853 
Protein GI219847420 
COG category[O] Posttranslational modification, protein turnover, chaperones
[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG0694] Thioredoxin-like proteins and domains
[COG2146] Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.140105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.34365 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAA CTGTACCCGA CGATACCGGT TTGCTCGAAC AAGCCGCCGC TCGCGTTGAC 
GCAGCGGTAG CGGCAGCAAA TAAACTCGAA CCGACAGCTC AAACCGTCGC TACCGAACTC
AAACACGCCA TTGAGGCTTT TCACAAACTC GCCCTGAATA CTATCGTGCG GCGATTGAAG
CAAGACCCCC ACGGCAAAGC AATCTTATTT GAGTTGGTTG AAGACCCCGC CGTGTACGCG
CTCTTGCTGA TGCACGGTAT TGTGCGCGCC GACCCCGTCA CCCGCGCCCG TCGCGTACTT
GATAACGCAC GCCCGTATAT GCAGTCGCAC GGTGGAGACG CCGAATTGGT TGATGTGCGC
GACGGCGTGG CTTACGTGCG CCTACACGGT TCGTGCAATG GTTGTTCGCT CTCAGCCTTT
ACCCTACGCA AACACGTCGA AGAGGCCCTG TTACGTGAAG TACCGGAAAT GACCCGCCTT
GAGGTAGTAA CCGACCAGGC CACGCCCGCG ATCCTCCGTG CGGAAGCACA AGAAATGCCT
GCCGTCGAAA AAGGTTGGGT ACGTGGCCCT GCCGTCACCG AGGTTCCGCC CGGTCAGATG
GTGAGTATCA CAACCGAACG TGGCAGTGTC CTCATTGTCA ATTTTGCCAA CCGACTTAGC
GCCTATCGCA ACGCCTGTGC GCACCAAGGC CGCCCGCTCA ACGATGGAAT ACTTGATCCA
ATTACCGGTA CGCTCACCTG TCGGTGGCAT GGCTTCTGTT TCGATCTGCA AAGCGGAGAA
TGCCTGACTG CACCGCAAGC GCAGCTTGAA CCATTCCCCT TACGAGTAGT TGACGGCATC
ATTTGGGTAC GACCGCAATG A
 
Protein sequence
MTQTVPDDTG LLEQAAARVD AAVAAANKLE PTAQTVATEL KHAIEAFHKL ALNTIVRRLK 
QDPHGKAILF ELVEDPAVYA LLLMHGIVRA DPVTRARRVL DNARPYMQSH GGDAELVDVR
DGVAYVRLHG SCNGCSLSAF TLRKHVEEAL LREVPEMTRL EVVTDQATPA ILRAEAQEMP
AVEKGWVRGP AVTEVPPGQM VSITTERGSV LIVNFANRLS AYRNACAHQG RPLNDGILDP
ITGTLTCRWH GFCFDLQSGE CLTAPQAQLE PFPLRVVDGI IWVRPQ