Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0476 |
Symbol | |
ID | 7266644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 586292 |
End bp | 587152 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643565339 |
Product | nitrogen-fixing NifU domain protein |
Protein accession | YP_002461853 |
Protein GI | 219847420 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG0694] Thioredoxin-like proteins and domains [COG2146] Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.140105 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.34365 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAAA CTGTACCCGA CGATACCGGT TTGCTCGAAC AAGCCGCCGC TCGCGTTGAC GCAGCGGTAG CGGCAGCAAA TAAACTCGAA CCGACAGCTC AAACCGTCGC TACCGAACTC AAACACGCCA TTGAGGCTTT TCACAAACTC GCCCTGAATA CTATCGTGCG GCGATTGAAG CAAGACCCCC ACGGCAAAGC AATCTTATTT GAGTTGGTTG AAGACCCCGC CGTGTACGCG CTCTTGCTGA TGCACGGTAT TGTGCGCGCC GACCCCGTCA CCCGCGCCCG TCGCGTACTT GATAACGCAC GCCCGTATAT GCAGTCGCAC GGTGGAGACG CCGAATTGGT TGATGTGCGC GACGGCGTGG CTTACGTGCG CCTACACGGT TCGTGCAATG GTTGTTCGCT CTCAGCCTTT ACCCTACGCA AACACGTCGA AGAGGCCCTG TTACGTGAAG TACCGGAAAT GACCCGCCTT GAGGTAGTAA CCGACCAGGC CACGCCCGCG ATCCTCCGTG CGGAAGCACA AGAAATGCCT GCCGTCGAAA AAGGTTGGGT ACGTGGCCCT GCCGTCACCG AGGTTCCGCC CGGTCAGATG GTGAGTATCA CAACCGAACG TGGCAGTGTC CTCATTGTCA ATTTTGCCAA CCGACTTAGC GCCTATCGCA ACGCCTGTGC GCACCAAGGC CGCCCGCTCA ACGATGGAAT ACTTGATCCA ATTACCGGTA CGCTCACCTG TCGGTGGCAT GGCTTCTGTT TCGATCTGCA AAGCGGAGAA TGCCTGACTG CACCGCAAGC GCAGCTTGAA CCATTCCCCT TACGAGTAGT TGACGGCATC ATTTGGGTAC GACCGCAATG A
|
Protein sequence | MTQTVPDDTG LLEQAAARVD AAVAAANKLE PTAQTVATEL KHAIEAFHKL ALNTIVRRLK QDPHGKAILF ELVEDPAVYA LLLMHGIVRA DPVTRARRVL DNARPYMQSH GGDAELVDVR DGVAYVRLHG SCNGCSLSAF TLRKHVEEAL LREVPEMTRL EVVTDQATPA ILRAEAQEMP AVEKGWVRGP AVTEVPPGQM VSITTERGSV LIVNFANRLS AYRNACAHQG RPLNDGILDP ITGTLTCRWH GFCFDLQSGE CLTAPQAQLE PFPLRVVDGI IWVRPQ
|
| |