Gene Mmar10_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_3041 
SymbolnusA 
ID4284260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp3325753 
End bp3327471 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content61% 
IMG OID638142537 
Producttranscription elongation factor NusA 
Protein accessionYP_758260 
Protein GI114571580 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATTG GTGTCAGTGC AAACCGGCTG GAATTGCTGC AGATCGCCCG TGCGGTCGCG 
GCAGAAAAGT CCATCGATGA ATCGATCGTG ATCGAGGCCA TCGAGGAAGC CATCCAGAAG
GCCGCGCGCT CCCGCTATGG GGCCGAAAAC GACATCCGCG CCAAGATCGA CCCGAAAACG
GGCGAGCTGT CGCTGACCCG CAACATGACG GTTGTGGAAG AAGTCGAGAA TGACAGCCAG
GAGCTGACCC TCGCCGACGC CAAGAAAATC GACAAGACCG CCGAGATCGG CACGGTCTTC
TCCGACGAGC TGCCGCCGAT CGAATTTGGT CGTGTGGCCT CGCAGACGGC CAAGCAGGTC
ATCACGCAAA AGGTCCGTGA AGCCGAGCGT CAGCGCCAGT TCGAAGAGTA CAAGGACCGT
GTCGGCGAGA TCATTTCCGG CATCGTCAAG CGCGTCGAAT ATGGCAATGT GATCATCGAT
CTCGGTCGTG CCGAGGCGAT CATCCGCCGT GCCGATGGCA TCCCGCGCGA GAATCTGCAG
AACAATGAAC GTGTCCGCGC CTACATCTAT GATGTGCGGG AAGAAGTTCG TGGCCCGCAG
ATATTCCTGT CGCGCGCTCA TCCTGACTTC ATGGCCGCCC TGTTCGCCCA GGAAGTGCCG
GAAGTCTATG AGGGCATCAT CGAGATCCCG TCGGTCGCAC GCGACCCGGG TTCGCGCGCC
AAGATCGCCG TCATCTCGAA TGATGGCTCC ATTGATCCGG TCGGTGCCTG TGTCGGTATG
CGCGGCTCGC GTGTTCAGGC GGTGGTGTCC GAGCTGGCTG GCGAGAAGAT CGACATCATC
CCGTGGTCGG ATGATCCGGC GACCTTCATC GTCAACGCGT TGCAGCCGGC CGAAGTGGCC
AAGGTCGTCC TCGACGAGGA AGATCAGCGC ATCGAGGTTG TCGTGCCGGA TGAGCAGCTG
TCGCTGGCCA TTGGTCGTCG CGGCCAGAAT GTCCGCCTCG CCTCGCAGCT GACCGGCTGG
TCGATCGACA TCCTGACCGA GGAAGAAGAG TCCGAGCGTC GCCAGAAGGA ATTCGCCGAG
CGGACCCAGA TCTTTATCGC TGCCCTTGAT GTTGATGAAG TCATTGCCCA GCTTCTGGCG
ACGGAAGGCT TCACCGATGT TGAAGACCTT GCCTATGCCG ATCTCGGCGA AATCGGCGCG
ATCGAAGGCT TCGACGAGGA CACGGCTGAG GAAATCCAGG CCCGCGCCCG CGATTACCTG
GACCGCCTGT CGGCCGAGCA GGACGCCAAG CGCAAGGAGC TGGGTGTCGA GGATGCAGTG
CTTGAGGTCG AGGGTGTTGT CCTCGCCATG GCTGTGAAAT TCGGTGAGAA CGACGTCAAG
ACGGTCGACG ATCTGGCTGG CCTGGTCACC GATGACCTTC GTGGCTGGTT TGAAACCAAG
AATGGCGAGC GCGTGCGCGA GCCGGGCATA CTGGAAGAGT TCAATCTCGC CGCTGAAGAT
GCCGAGATGA TGATCATGCG GGCTCGCGTT GCGGCTGGCT GGATCTCGGA AGAGGACCTG
CCGCAGCCGG AAGTCGTCGA GGAGGAGGTC GATGAGGCCG CCGCTGCCTT CGACGTCGAG
AACATGGATC TGGCGGCTCT GGAAGCCGAA GCCGAGGCCC TTGGTCTCGA CCTCGACGCC
GAGCTGCCTG AAGAGGGCGA GGAGGCGAAG GCTGACTAG
 
Protein sequence
MSIGVSANRL ELLQIARAVA AEKSIDESIV IEAIEEAIQK AARSRYGAEN DIRAKIDPKT 
GELSLTRNMT VVEEVENDSQ ELTLADAKKI DKTAEIGTVF SDELPPIEFG RVASQTAKQV
ITQKVREAER QRQFEEYKDR VGEIISGIVK RVEYGNVIID LGRAEAIIRR ADGIPRENLQ
NNERVRAYIY DVREEVRGPQ IFLSRAHPDF MAALFAQEVP EVYEGIIEIP SVARDPGSRA
KIAVISNDGS IDPVGACVGM RGSRVQAVVS ELAGEKIDII PWSDDPATFI VNALQPAEVA
KVVLDEEDQR IEVVVPDEQL SLAIGRRGQN VRLASQLTGW SIDILTEEEE SERRQKEFAE
RTQIFIAALD VDEVIAQLLA TEGFTDVEDL AYADLGEIGA IEGFDEDTAE EIQARARDYL
DRLSAEQDAK RKELGVEDAV LEVEGVVLAM AVKFGENDVK TVDDLAGLVT DDLRGWFETK
NGERVREPGI LEEFNLAAED AEMMIMRARV AAGWISEEDL PQPEVVEEEV DEAAAAFDVE
NMDLAALEAE AEALGLDLDA ELPEEGEEAK AD