Gene A9601_16961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_16961 
SymbolnusA 
ID4718426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1438788 
End bp1440191 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content37% 
IMG OID640079422 
Producttranscription elongation factor NusA 
Protein accessionYP_001010086 
Protein GI123969228 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0339666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTAG TTATTCTCCC AGGTTTAAAC AATCTCATTG AAGATATTAG TGAGGAAAAA 
AAGTTACCTC CTAATATCGT GGAATTAGCC TTACGCGAAG CTTTATTAAA AGGATATGAA
AAATATAGAA AAACTTTTTA CATTGGAGTT AACCAAGATC CATTTGATGA AGAATATTTT
AGTAATTTTG ATGTTGGACT AGATCTAGAT GAAGAAGGTT ACAGGATATT ATCAAGTAAA
ATTATTGTAG AAGAAGTTGA GAGCGAAGAT CATCAAATAT CTCTAGTAGA AGTTAAGCAA
GTCGCTGATG ATGCTCAAAT AGGTGACACA GTTGTTTTAG ACGTTACTCC AGAAAAAGAG
GATTTTGGGC GAATGGCTGC TTCAACAACA AAGCAAGTTT TAGCCCAAAA GTTAAGAGAT
CAACAACGAA AAATGATCCA GGAAGAATTT GCGGATTTGG AAGATCCTGT TTTAACGGCA
AGAGTTATAA GATTTGAAAG ACAATCAGTC ATTATGGGAG TTAGTTCGGG TATTGGTAGA
CCTGAAGTTG AGGCCGAACT TCCCAAGAGA GATCAATTAC CAAATGATAA TTATAGAGCA
AATGCAACTT TTAAAGTATT TTTGAAAGAA GTTAGCGAAA TTGCCAGAAA AGGGCCGCAA
CTTTTTGTAA GTAGAGCAAA TGCTGGTTTA GTGGTTTATT TATTTGAAAA TGAAGTACCG
GAAATTCAAG AAGGTACAGT GAAAATTGTT GCTGTTTCAA GAGAAGCCAA CCCTCCTTCA
AGAGCTGTTG GGCCAAGAAC AAAAGTAGCT GTTGATAGTG TCGAAGAAGA AGTGGACCCT
GTAGGTGCAT GTATTGGAGC TAGAGGAGCA AGAATTCAAC AAGTAGTAAA TGAATTAAGG
GGTGAAAAAA TTGATGTTAT TAAATGGTCA TCTAACCCAA TACAGTATAT TTTAAACTCT
TTAAGTCCTG CGAAAGTAGA TCAAGTAAGA CTTGTAGACC CAGCAGGGCA ACATGCGCAC
GTACTAGTTC CTCCTGATCA ATTAAGTCTC GCAATTGGTA GAGAAGGTCA AAATGTAAGA
CTTGCCGCAA GATTAACTGG TTGGAAGATT GACGTTAAAA ACTCACATGA ATACGATCAG
GAAGCAGAAG ATGCTGCGGT CTCTGAATTA ATTATTCAAA GGGAAGATGA AGAGAATCTC
CAGAGAGAAG CTGAATTAAG ATTAGAAGCA GAACAAGCTG AGCGTGCTGC AGAAGATGCG
AGATTAAGAG AGCTTTATCC TCTTCCCGAA GATGAAGAAG AATATGGAGA GGAACAATAC
GAAGGAGTAG AATTCACAGA TAATGATCCA TTAGAGACTG TTCAAGATAC TGAGACATCT
GCCAAAGAGG AGAAAAAACG GTGA
 
Protein sequence
MALVILPGLN NLIEDISEEK KLPPNIVELA LREALLKGYE KYRKTFYIGV NQDPFDEEYF 
SNFDVGLDLD EEGYRILSSK IIVEEVESED HQISLVEVKQ VADDAQIGDT VVLDVTPEKE
DFGRMAASTT KQVLAQKLRD QQRKMIQEEF ADLEDPVLTA RVIRFERQSV IMGVSSGIGR
PEVEAELPKR DQLPNDNYRA NATFKVFLKE VSEIARKGPQ LFVSRANAGL VVYLFENEVP
EIQEGTVKIV AVSREANPPS RAVGPRTKVA VDSVEEEVDP VGACIGARGA RIQQVVNELR
GEKIDVIKWS SNPIQYILNS LSPAKVDQVR LVDPAGQHAH VLVPPDQLSL AIGREGQNVR
LAARLTGWKI DVKNSHEYDQ EAEDAAVSEL IIQREDEENL QREAELRLEA EQAERAAEDA
RLRELYPLPE DEEEYGEEQY EGVEFTDNDP LETVQDTETS AKEEKKR