Gene NSE_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0521 
SymbolnusA 
ID3931612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp444417 
End bp446030 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content42% 
IMG OID637900677 
ProductN utilization substance protein A 
Protein accessionYP_506405 
Protein GI88608874 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTTT CCAATAATGG ACTCAGTAAT CTCCAAATTG TGGAATCCAT AAATTCTGTC 
GCAGAAAAAG AGGGACTTAG TCCTGATACT CTGTTTCGCG CGATAGGAAT CGAGTTAGCA
CATGAAATAG GAAAAAGGCA GTATGGAGAC CATAGAATTT TTGTTGAGAT AGACAAAAAA
AGTGGTGAGA TACTTGTATC AAAGCGACTT CTTGTAGTCG AAGATTCTGA TAAAGCTCGT
ATGCTTGAGC AGATGGAAGT TACTACCGAA GAGTCCGACG ATTTGCCTTC CTCATTTCGT
GCGGAGGAAA AGGTTCATTA TGATGGTGTA ATCGACTTGT CAACTGCAAG GCTCAAGTAC
CCGAGTATTG AGCACAAAGC GGGCGATATA ATCACAGAAC ATTTACCTTC ATTTTCTTCT
GGATACATAA TTGCCAGAGT CATGAAAGCG AAGCTTGAGC GCCTCATTAC CTCATTAGTG
AGGGAAAAAC AGTATCACTG TTACAAAGGC AGAGTTGGTG AAATTGTTAC AGGCATCGTG
AAAAAATCCA TTGATTTTAA GACGGGTAGC CGAAGCATTA TCGTCGATAT CGCAGGAGTG
GAGGGCCTTT TACCTTATTC ATCACTAGTA AAAGGGGAGA GCTTTAGGCC TGGGGAAAGA
GTCAAGTGTG TTATACAAAA AGTTGAGTAT TCAGTTGTCA AACCTCAAAT CTTGCTCTCT
AGGTCTAGTG GCAGTTTTGT CGCTCAGCTC TTTTCTCAGC AGGTGCCAGA AATATATGAT
CGTGTAGTAG AAATAAGGAA AGTCGCCAGA GATGCTGGCT CCAGAAGCAA GGTCGCTGTT
TTTTCATCGG ATAGAAACAT AGATCCAGTT GGAGCATGTA TTGGCATGGG TGGAAGTAGA
ATAAACGCCG TAGTAAACGA GTTGCACGGT GAAAAGATAG ATATAGTCGA ATATTCGAAC
GATACTGCCA CCTTCCTAGC CAATGCCCTT AAACCAATCA GACCAGTTAA GATTACTGTA
AACGAAGAGA CAAAAAAAAT AGAACTAGTT GTCCCTGATG AAAGTGTCAG CCTTGTTATA
GGACGCGGTG GGCAGAATGT ATACTTGTTA TCCTCTCTCC TTGGATATCG TGTAGAGGTT
CTCAGTGATG CCGAATTTTC CAAAAAGAAA ATGGAAGAAT TCATTTCGGG GACCGCACGC
TTTGTAGAAG CCCTGAATGT CGAGGAAGTA ATAGCTCAGC TATTGGTTAC TGAAGGATTC
TCAACGGTTG AAGAAATCGC TGACTGTAAT ACGTCAAGGC TGGCCTTCAT TGAGGGGTTT
GATAAAGATA TCGCTGAAGA GATAAGGAGT AGGGCGGTTG AATATGTGAA TGAGCAACCT
AAGAGAGTAC GGGCTCTGGC AGAAAAGTAT AAAGCAAATC CGAATATGCT TGCCCTTTCA
AGCTTTGACA CGGGATTACT TGAGGTACTT TTCTCATCTG GACTGACGGA TCTTGAAAAG
GTTGCCGAGT TGTCTTGTGA TGAATTAAGA GAGGTCATTG GAGATAATGG ATTTGGTGTG
CCGTTGCTAG AACAGCTTAT CATCAGATCA AGGAAGACTC TGGGTTGGTT ATAG
 
Protein sequence
MSFSNNGLSN LQIVESINSV AEKEGLSPDT LFRAIGIELA HEIGKRQYGD HRIFVEIDKK 
SGEILVSKRL LVVEDSDKAR MLEQMEVTTE ESDDLPSSFR AEEKVHYDGV IDLSTARLKY
PSIEHKAGDI ITEHLPSFSS GYIIARVMKA KLERLITSLV REKQYHCYKG RVGEIVTGIV
KKSIDFKTGS RSIIVDIAGV EGLLPYSSLV KGESFRPGER VKCVIQKVEY SVVKPQILLS
RSSGSFVAQL FSQQVPEIYD RVVEIRKVAR DAGSRSKVAV FSSDRNIDPV GACIGMGGSR
INAVVNELHG EKIDIVEYSN DTATFLANAL KPIRPVKITV NEETKKIELV VPDESVSLVI
GRGGQNVYLL SSLLGYRVEV LSDAEFSKKK MEEFISGTAR FVEALNVEEV IAQLLVTEGF
STVEEIADCN TSRLAFIEGF DKDIAEEIRS RAVEYVNEQP KRVRALAEKY KANPNMLALS
SFDTGLLEVL FSSGLTDLEK VAELSCDELR EVIGDNGFGV PLLEQLIIRS RKTLGWL