Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0031 |
Symbol | nusA |
ID | 5897743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 36381 |
End bp | 38087 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641560514 |
Product | transcription elongation factor NusA |
Protein accession | YP_001681667 |
Protein GI | 167644004 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.351795 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATCG GCATCTCCGC CAACCGCCTC GAGCTGCTGC AGATCGCCGA CGCGGTCGCG CGTGAAAAAG GCATCGAGAA GGAAGTCGTC ATCGAGGCGA TCGAGGACGC CCTGCAGAAG GCCGCCCGCG CTCGCTACGG CGCCGAGCAC GACATCCGCG TGAAGATCGA CACCAAGACC GGCGAGACCA CCCAGAAGCG GGTGATCGAG GTCGTGCCGG ACGACTTCGA GCTGGAAGGC GAGATCGGCA AGGTTCAGCT GTCGTCGGCC AAGCGCACCT GGCGCGACGC CGAGGTCGGC AAGATCTACG AGGAAAGCCT GCCGCCGTTC GAGATCGGCC GCGTCCAGAC CCAGATGGCC CGCCAGGTCG TCATGCATAA GGTCCGCGAA GCCGAGCGCG AGCGCCAGTA CGACGAGTAC AAGGATCGCG CCGGCGAGAT CGTCAACGGC AGCGTCAAGC GCGTCGAATA CGGCAACGTC ATCGTCGACC TGGGCCGCGG CGAAGGCATC ATGCGCCGCG ACCAGTCGAT CCCGCGCGAG AATTTCAACG TCGGCGACCG CATCCGCGCC TACATCTACG ACGTCCGTCG CGAGACCAAG GGCCCGCAGA TCATGCTCAG CCGCGCCCAC GGCGGCTTCA TGGCCAAGCT GTTCGCGCAG GAAGTGCCGG AAGTCTATGA CGGCGTCATC GAGATCCGCG CCGTGGCCCG CGACCCGGGC TCGCGCGCCA AGATGGCCGT GATCTCGAAC GACAGCAGCA TCGACCCCGT CGGCGCCTGC GTCGGCATGC GCGGTTCGCG CGTGCAGGCG GTGGTGGCCG AACTGCAGGG CGAGAAGATC GACATCATCC AGTGGTCCGA GGACGAGGCG ACCTTCATCG TCAACGCCCT GGCCCCGGCC GAAGTCTCCA AGGTCGTCAT GGACGAGGAA GACGAGCGCG TCGAAGTGGT GGTGCCCGAC GAGCAGCTGT CGCTGGCCAT CGGCCGCCGC GGCCAGAACG TCCGCCTGGC CTCGCAGCTG ACCGGCTGGC AGATCGACAT CATGACGGAA AGCCAGGAGA GCGAGCGCCG TCAGAAGCAG TTCACCGAGA CCACCGCCCT GTTCCAGGAA GCCCTGGACG TCGACGAGGT CATCGCCCAA CTGCTGGTCA CCGAGGGCTT CGCCACGGTG GAAGACGTCG CCTATGTCGA GCCGCACGAG ATCGCGGCCA TCGAGGGCTT CGACGACGAG ACCGCCGACG AATTGCAGAC CCGGGCCCGC GAATTCCTCG ACAAGGAAGC CGCCGCCCTC GACGCCAAGC GCGTCGAGTT GGGCGTCGAG GACGGCCTGC TCGAGATCGA AGGCGTCACC CTGCCCGTGG CCGTGGCCCT GGGCGAAGGC GACGTGAAGT CGGTCGAGGA CCTGGCGGGC CTGATCCCCG ACGACCTGCG CGGCTGGTTC GAGACCAAGG ACGGCGAGCG CACCCGCGAA GCCGGCATCC TCGACAGCTT CAACCTGTCG CCGGAAGACG CCGAGGCGCT GATCATGCGC GCGCGCGTCG TCATGGGTTG GGTCGAGGCT CCGCCGGAAC CGGAATATGT CGAGGAAGAA AGCGTTTATG CGGAAGAGGC GGGCGAAGAG CCTGCCGAGG CCTCGGACGA GATCGCCGAG GACGCGGAGC CGGTCGAAGA CACCGAGGAC GCGCCCGAAG AAACCACCGA AGACTGA
|
Protein sequence | MAIGISANRL ELLQIADAVA REKGIEKEVV IEAIEDALQK AARARYGAEH DIRVKIDTKT GETTQKRVIE VVPDDFELEG EIGKVQLSSA KRTWRDAEVG KIYEESLPPF EIGRVQTQMA RQVVMHKVRE AERERQYDEY KDRAGEIVNG SVKRVEYGNV IVDLGRGEGI MRRDQSIPRE NFNVGDRIRA YIYDVRRETK GPQIMLSRAH GGFMAKLFAQ EVPEVYDGVI EIRAVARDPG SRAKMAVISN DSSIDPVGAC VGMRGSRVQA VVAELQGEKI DIIQWSEDEA TFIVNALAPA EVSKVVMDEE DERVEVVVPD EQLSLAIGRR GQNVRLASQL TGWQIDIMTE SQESERRQKQ FTETTALFQE ALDVDEVIAQ LLVTEGFATV EDVAYVEPHE IAAIEGFDDE TADELQTRAR EFLDKEAAAL DAKRVELGVE DGLLEIEGVT LPVAVALGEG DVKSVEDLAG LIPDDLRGWF ETKDGERTRE AGILDSFNLS PEDAEALIMR ARVVMGWVEA PPEPEYVEEE SVYAEEAGEE PAEASDEIAE DAEPVEDTED APEETTED
|
| |