Gene Caul_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0031 
SymbolnusA 
ID5897743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp36381 
End bp38087 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content67% 
IMG OID641560514 
Producttranscription elongation factor NusA 
Protein accessionYP_001681667 
Protein GI167644004 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.351795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCG GCATCTCCGC CAACCGCCTC GAGCTGCTGC AGATCGCCGA CGCGGTCGCG 
CGTGAAAAAG GCATCGAGAA GGAAGTCGTC ATCGAGGCGA TCGAGGACGC CCTGCAGAAG
GCCGCCCGCG CTCGCTACGG CGCCGAGCAC GACATCCGCG TGAAGATCGA CACCAAGACC
GGCGAGACCA CCCAGAAGCG GGTGATCGAG GTCGTGCCGG ACGACTTCGA GCTGGAAGGC
GAGATCGGCA AGGTTCAGCT GTCGTCGGCC AAGCGCACCT GGCGCGACGC CGAGGTCGGC
AAGATCTACG AGGAAAGCCT GCCGCCGTTC GAGATCGGCC GCGTCCAGAC CCAGATGGCC
CGCCAGGTCG TCATGCATAA GGTCCGCGAA GCCGAGCGCG AGCGCCAGTA CGACGAGTAC
AAGGATCGCG CCGGCGAGAT CGTCAACGGC AGCGTCAAGC GCGTCGAATA CGGCAACGTC
ATCGTCGACC TGGGCCGCGG CGAAGGCATC ATGCGCCGCG ACCAGTCGAT CCCGCGCGAG
AATTTCAACG TCGGCGACCG CATCCGCGCC TACATCTACG ACGTCCGTCG CGAGACCAAG
GGCCCGCAGA TCATGCTCAG CCGCGCCCAC GGCGGCTTCA TGGCCAAGCT GTTCGCGCAG
GAAGTGCCGG AAGTCTATGA CGGCGTCATC GAGATCCGCG CCGTGGCCCG CGACCCGGGC
TCGCGCGCCA AGATGGCCGT GATCTCGAAC GACAGCAGCA TCGACCCCGT CGGCGCCTGC
GTCGGCATGC GCGGTTCGCG CGTGCAGGCG GTGGTGGCCG AACTGCAGGG CGAGAAGATC
GACATCATCC AGTGGTCCGA GGACGAGGCG ACCTTCATCG TCAACGCCCT GGCCCCGGCC
GAAGTCTCCA AGGTCGTCAT GGACGAGGAA GACGAGCGCG TCGAAGTGGT GGTGCCCGAC
GAGCAGCTGT CGCTGGCCAT CGGCCGCCGC GGCCAGAACG TCCGCCTGGC CTCGCAGCTG
ACCGGCTGGC AGATCGACAT CATGACGGAA AGCCAGGAGA GCGAGCGCCG TCAGAAGCAG
TTCACCGAGA CCACCGCCCT GTTCCAGGAA GCCCTGGACG TCGACGAGGT CATCGCCCAA
CTGCTGGTCA CCGAGGGCTT CGCCACGGTG GAAGACGTCG CCTATGTCGA GCCGCACGAG
ATCGCGGCCA TCGAGGGCTT CGACGACGAG ACCGCCGACG AATTGCAGAC CCGGGCCCGC
GAATTCCTCG ACAAGGAAGC CGCCGCCCTC GACGCCAAGC GCGTCGAGTT GGGCGTCGAG
GACGGCCTGC TCGAGATCGA AGGCGTCACC CTGCCCGTGG CCGTGGCCCT GGGCGAAGGC
GACGTGAAGT CGGTCGAGGA CCTGGCGGGC CTGATCCCCG ACGACCTGCG CGGCTGGTTC
GAGACCAAGG ACGGCGAGCG CACCCGCGAA GCCGGCATCC TCGACAGCTT CAACCTGTCG
CCGGAAGACG CCGAGGCGCT GATCATGCGC GCGCGCGTCG TCATGGGTTG GGTCGAGGCT
CCGCCGGAAC CGGAATATGT CGAGGAAGAA AGCGTTTATG CGGAAGAGGC GGGCGAAGAG
CCTGCCGAGG CCTCGGACGA GATCGCCGAG GACGCGGAGC CGGTCGAAGA CACCGAGGAC
GCGCCCGAAG AAACCACCGA AGACTGA
 
Protein sequence
MAIGISANRL ELLQIADAVA REKGIEKEVV IEAIEDALQK AARARYGAEH DIRVKIDTKT 
GETTQKRVIE VVPDDFELEG EIGKVQLSSA KRTWRDAEVG KIYEESLPPF EIGRVQTQMA
RQVVMHKVRE AERERQYDEY KDRAGEIVNG SVKRVEYGNV IVDLGRGEGI MRRDQSIPRE
NFNVGDRIRA YIYDVRRETK GPQIMLSRAH GGFMAKLFAQ EVPEVYDGVI EIRAVARDPG
SRAKMAVISN DSSIDPVGAC VGMRGSRVQA VVAELQGEKI DIIQWSEDEA TFIVNALAPA
EVSKVVMDEE DERVEVVVPD EQLSLAIGRR GQNVRLASQL TGWQIDIMTE SQESERRQKQ
FTETTALFQE ALDVDEVIAQ LLVTEGFATV EDVAYVEPHE IAAIEGFDDE TADELQTRAR
EFLDKEAAAL DAKRVELGVE DGLLEIEGVT LPVAVALGEG DVKSVEDLAG LIPDDLRGWF
ETKDGERTRE AGILDSFNLS PEDAEALIMR ARVVMGWVEA PPEPEYVEEE SVYAEEAGEE
PAEASDEIAE DAEPVEDTED APEETTED