Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3683 |
Symbol | |
ID | 7268218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 4475838 |
End bp | 4477166 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643568489 |
Product | NusA antitermination factor |
Protein accession | YP_002464955 |
Protein GI | 219850522 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00388019 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGCG ATTTTTACGC GGCAATTACT CAGATTGCGT CTGAACGCGG TATTCCAAAA GAGGCCATCA TTGACGTGAT GGAACGGGCA TTGGTGAGCG CATACAAGCG CTTACTCGGC CCCAATCCAC CGGCGGTAGA AGTGACGGTG AAGCTCGATC CCGTCAGCGG TATGGCGCGG GTCTACGCCG AGAAGCAAGT GGTCGATGAA GTGTACGACG AGCGGTTTGA GATCGACCTC AACAGTGCGC GCCAAATCAA GCCCGACGTT CAGATCGGCG AGACGGTGCT GGTCGAGAGC ACACCGCGTG ATTTTGGCCG GATCGCCGCC CAAACTGCGA AGCAAGTGGT GTTGCAAGGG ATTAAAGAGA TCGAACGGAG CTATATCTAT AGCGAATTTG AAGATCGCGA AGGCGAGTTG GTGACGGCGA CGGTACAGCG CAACAACGGG CCACGCGGCA ACGTGATCCT TGAGATAGGG AAGGCCGAGG CCATTATCCC GCCGAAAGAG CAAGTGGCAA ACGATCGCTA TTACCACGGC CAGCGGCTCA AGGTATTGCT GCTCGAGGTC AAGAAGGATG ATCGCGGCCC TCGCCTGATT GCGTCGCGCG CGCACAAGAA TCTGATCAAC CGCCTGTTTG AGATGGAAGT TCCCGAAATC TACAACGGGT CGGTTGAGAT TAAGTCGGTG GCCCGTGAAC CCGGTCTGCG CACGAAAGTC GCTGTTGCGG CCCGCCAAGA GGGGATCGAT CCGGTCGGGT CATGCGTCGG CATGCGTGGT ATCCGCATTC AGAATATTGT TAATGAGCTA AACGGCGAAA AGATCGATGT GGTGCAGTGG TCGTCGGATC CACGTGAGTA TATCGCCAAC GCCCTCTCGC CGGCACAAGT GGTTGAGGTG CATCTCCACG ATCACGACCA TACCGCGCTG GTGATTGTCC CCGACAAACA ACTTTCGCTG GCAATCGGTA AGGAAGGGCA GAACGTCCGG CTGGCAGCGA AGCTCACCGG TTGGCGGATC GATATTAAGA GTGCGTCGGC GTTGCTCGAA GAGGAACGGG CCGCTGCTGA GGCCCGCGCT GAAGCTGAGG CCGAACAGAT GGTGATTGCA GCCGAATTGG CGCATGCCCC GGTTGAACAG CGCATCGTAC AGGCTGACGG CACCATCCGC TACCGCGGTC ATCGCTACGG CCCACTCGGT AACGATTTGG TTGGACAGAC GGTGCAGGTA CGGGCAACAT CGCAAAAGAT CTTTATTTAC TATAATGACC GCCTGTTCGC CTCCTATATC TTGCTGGAAG GTGGACAGGC TTCGCCTGAT GAGGAGTAA
|
Protein sequence | MKSDFYAAIT QIASERGIPK EAIIDVMERA LVSAYKRLLG PNPPAVEVTV KLDPVSGMAR VYAEKQVVDE VYDERFEIDL NSARQIKPDV QIGETVLVES TPRDFGRIAA QTAKQVVLQG IKEIERSYIY SEFEDREGEL VTATVQRNNG PRGNVILEIG KAEAIIPPKE QVANDRYYHG QRLKVLLLEV KKDDRGPRLI ASRAHKNLIN RLFEMEVPEI YNGSVEIKSV AREPGLRTKV AVAARQEGID PVGSCVGMRG IRIQNIVNEL NGEKIDVVQW SSDPREYIAN ALSPAQVVEV HLHDHDHTAL VIVPDKQLSL AIGKEGQNVR LAAKLTGWRI DIKSASALLE EERAAAEARA EAEAEQMVIA AELAHAPVEQ RIVQADGTIR YRGHRYGPLG NDLVGQTVQV RATSQKIFIY YNDRLFASYI LLEGGQASPD EE
|
| |