Gene P9301_16831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_16831 
SymbolnusA 
ID4912096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1414567 
End bp1415970 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content38% 
IMG OID640161280 
Producttranscription elongation factor NusA 
Protein accessionYP_001091907 
Protein GI126697021 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.422838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTAG TTATTCTCCC AGGTTTAAAC AATCTTATTG AAGACATTAG TGAGGAAAAA 
AAATTACCTC CTAATATCGT TGAAGCAGCC TTGCGCGAAG CTTTGTTAAA GGGATATGAA
AAATATAGAA GAACTTTTTA CATTGGAGTT AACGAAGATC CATTTGATGA AGAGTACTTC
AGTAATTTTG ATGTTGGACT AGATCTAGAT GAAGAGGGTT ACAGGATATT ATCTAGTAAA
ATAATTGTTG AAGAAGTAGA GAGCGAAGAT CATCAAATAT CTCTAATAGA AGTTAAACAA
GTTGCTGATG ATGCGCAAAT AGGTGACACA GTTGTATTAG ATGTCACTCC TGAAAAAGAG
GATTTTGGGC GAATGGCTGC TTCAACAACA AAGCAAGTTT TAGCACAAAA ATTAAGGGAT
CAACAACGAA AAATGATCCA AGAAGAATTT GCAGATTTGG AGGATCCTGT TTTAACTGCA
AGAGTTATCA GATTTGAAAG ACAATCAGTG ATTATGGGAG TTAGTTCGGG AATCGGCAGA
CCCGAGGTTG AAGCAGAACT TCCCAAGAGA GATCAATTAC CAAATGATAA CTATAGAGCA
AATGCAACTT TCAAAGTATT TTTAAAAGAA GTTAGCGAAA TTGCTAGAAA AGGTCCACAA
CTTTTTGTGA GTAGAGCTAA CGCTGGTTTA GTAGTTTATT TATTTGAAAA TGAAGTACCG
GAAATTCAGG AAGGTACAGT AAAAATAGTT GCTGTTTCAA GAGAAGCGAA TCCTCCTTCA
AGAGCTGTTG GGCCAAGAAC AAAAGTAGCT GTTGATAGCG TCGAAAATGA GGTCGACCCT
GTAGGTGCTT GTATTGGAGC GAGAGGAGCA AGAATCCAAC AAGTAGTTAA TGAATTAAGA
GGAGAAAAAA TTGATGTTAT TAAATGGTCA TCTGACCCAA TACAGTATAT TTTGAACTCC
TTAAGTCCGG CTAAAGTCGA TCTCGTGAGA CTTGTTGACC CTGAAGGTCA ACACGCGCAT
GTACTAGTTC CTCCTGATCA ATTAAGTCTC GCAATTGGTA GAGAAGGACA AAATGTAAGA
CTTGCGGCAA GATTAACTGG TTGGAAGATT GATGTTAAAA ACTCACATGA ATACGATCAG
GAAGCAGAAG ATGCTGCAGT CTCTGAATTA ATTATCCAAA GAGAAGATGA AGAGAAACTC
CAGCGTGAAG CTGAGCTTAG ATTAGAAGCA GAACAAGCTG AGCGTGCTGC GGAAGATGCA
AGATTAAGAG AGCTTTATCC CCTTCCCGAA GATGAAGAAG AATATGGAGA GGAACAATAC
GAAGGAGAAG AATTAACAGA TAATGATCCA TTAGAGACTC TTCAAGATAC TGACATATCT
GCCAAAGAGG AGAAAAAACG GTGA
 
Protein sequence
MALVILPGLN NLIEDISEEK KLPPNIVEAA LREALLKGYE KYRRTFYIGV NEDPFDEEYF 
SNFDVGLDLD EEGYRILSSK IIVEEVESED HQISLIEVKQ VADDAQIGDT VVLDVTPEKE
DFGRMAASTT KQVLAQKLRD QQRKMIQEEF ADLEDPVLTA RVIRFERQSV IMGVSSGIGR
PEVEAELPKR DQLPNDNYRA NATFKVFLKE VSEIARKGPQ LFVSRANAGL VVYLFENEVP
EIQEGTVKIV AVSREANPPS RAVGPRTKVA VDSVENEVDP VGACIGARGA RIQQVVNELR
GEKIDVIKWS SDPIQYILNS LSPAKVDLVR LVDPEGQHAH VLVPPDQLSL AIGREGQNVR
LAARLTGWKI DVKNSHEYDQ EAEDAAVSEL IIQREDEEKL QREAELRLEA EQAERAAEDA
RLRELYPLPE DEEEYGEEQY EGEELTDNDP LETLQDTDIS AKEEKKR