Gene Ava_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1870 
SymbolnusA 
ID3681822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2322023 
End bp2323300 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content46% 
IMG OID637717211 
Producttranscription elongation factor NusA 
Protein accessionYP_322387 
Protein GI75908091 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0338144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATGG TTAGTTTACC AGGATTAAAA GATTTAATAG AAAGTATTAG TCGTGAACGG 
AATTTGCCCC GGTTAGCGGT TCAAGCTGCT ATTAGAGAAG CCTTACTTAA GGGTTATGAG
CGTTATCGTC GCGCTCAAAA CATCGAGCGC AAACAGTTTG ATGAAGATTA TTTTGATAAC
TTTGAAGTCG AACTAGATAT TGACGAGGAA GGATTTCGCG TTCTTTCCAC AAAATCCATT
GTTGAAGAAG TTAATAATTC TGACCATCAA ATCTCCTTAG ATGAAGTACA GCAAGTAGCT
CCCGAAGCCC AATTGGGAGA CTCAGTGGTA CTAGATGTCA CCCCAGACCA AGGTGAATTT
GGGCGCATGG CTGCCATGCA AACCAAGCAA GTACTGGCGC AAAAACTGCG GGATCAACAG
CGCCAGATGG TGCAAGAAGA GTTCCAAGAC CTAGAAAGCA CAGTTCTCCA AGCCAGAGTT
TTGCGATTTG AGCGCCAATC AGTCATTCTG GCAGTTAGCA GTACCTTTGG ACAACCAGAA
GTAGAAGCCG AATTGCCCAA ACGCGAACAA TTGCCCAACG ACAATTATCG GGCAAATGCC
ACATTTAAGG TTTATCTCAA AAAGGTTTCC CAAGGTCAGC AACGCGGCCC ACAGTTATTA
GTCTCCCGTG CTGATGCAGG TTTAGTAGTT TATCTATTTG CCAACGAAGT ACCAGAAATT
GAAGACGAAG TGGTACGGAT AGTTGCCGTA GCCAGGGAGG CAAACCCCCC TTCCCGCTAT
GTAGGCCCAA GGACTAAAAT AGCAGTAGAT ACCCTGGATC GTGATGTAGA CCCCGTAGGG
GCTTGTATTG GTGCTAGGGG ATCACGCATT CAGGTAGTAG TCAACGAATT ACGCGGCGAA
AAAATTGACG TGATTCGCTG GTCTCCAGAC CCAGCAACAT ACATTGCTAA TGCCCTCAGT
CCGGCGCGAG TCGATGAAGT GCGCCTCATG GACCCAGAAA CTAGACAAAC TCACGTATTA
GTTGCGGAAG ACCAACTGAG TTTGGCTATC GGCAAAGAAG GACAAAACGT GCGATTAGCT
GCCCGATTGA CTGGCTGGAA AATAGACATA AAAGATAAAG CCAAGTATGA CCAAGCAGCC
GAAGATGCTA AATTTGTGGC GGCGCGTGCA AAATATCAAC TAGAGGAAGA TGACATCGAA
TCAGAGGAGC TAGACTATGA AGAAAATCAA GAAGGAGAAT TAGAAGACGA GTCTTTTGAC
CCCAACGATG AAGAGTAA
 
Protein sequence
MSMVSLPGLK DLIESISRER NLPRLAVQAA IREALLKGYE RYRRAQNIER KQFDEDYFDN 
FEVELDIDEE GFRVLSTKSI VEEVNNSDHQ ISLDEVQQVA PEAQLGDSVV LDVTPDQGEF
GRMAAMQTKQ VLAQKLRDQQ RQMVQEEFQD LESTVLQARV LRFERQSVIL AVSSTFGQPE
VEAELPKREQ LPNDNYRANA TFKVYLKKVS QGQQRGPQLL VSRADAGLVV YLFANEVPEI
EDEVVRIVAV AREANPPSRY VGPRTKIAVD TLDRDVDPVG ACIGARGSRI QVVVNELRGE
KIDVIRWSPD PATYIANALS PARVDEVRLM DPETRQTHVL VAEDQLSLAI GKEGQNVRLA
ARLTGWKIDI KDKAKYDQAA EDAKFVAARA KYQLEEDDIE SEELDYEENQ EGELEDESFD
PNDEE