Gene Daud_0923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0923 
SymbolnusA 
ID6027426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp984211 
End bp985290 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content59% 
IMG OID641593735 
Producttranscription elongation factor NusA 
Protein accessionYP_001717068 
Protein GI169831086 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000194155 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGCG AGTTTCTGGC GGCTTTGCGC GACCTGGAGA AAGAGCGCAG CATCAGTGTG 
GAAGTCCTGC TTGAAGCCAT CGAGGCGGCG CTTTTGTCCG CCTACCGGCG CAATTTCGGG
ACTTCACACA ACGCGCGGGT GCAGGTTGAC CGCCACACCG GAGACTGCAA GGTGTACGCC
AGGCGGACCG TTGTCCAGGA AGTGGAGGAC CCCCAGGATC AGATTTCCCT TGAGGAGGCC
AGAGCCATTA ACCCGGGCTA CCAACTGGAG GACACGGTAG AATCGGAAAT CACGCCGCGC
AATTTTGGAC GTATCGCCGC CCAAACGGCA AAACAGGTGG TGGTGCAGCG GATTCGGGAA
GCAGAGCGAA ACATGGTCTT CGAGGAATTT GCCAGCCGCG AGGGCGACAT CGTCACCGGT
GTGGTGCAGC GCATCGAGCA GCGCAACGTG TACATCGAAC TGGGCAAAAC CGAAGCCGTG
CTTCCCCCGG CGGAACAGAT ACCGAGAGAG AACTACCGGC CCGGTCAACG GCTGAAGACA
TATATTGTCG AAGTGAAGAA GACCACCAAG GGGCCGTTGA TCCTGGTTTC CCGGACTCAT
CCGGGTCTCC TGAAGCGGCT GTTTGAGATT GAGGTTCCCG AGTTGCACCA GGGGCTGGTG
GAACTGAAAG CGGTGGCGCG GGAGGCTGGG ATCCGGTCCA AGATCGCCGT CTATTCGAAT
GATGAGGGCA TTGATCCGGT CGGGGCTTGT GTCGGCCCGA AGGGCGCCCG GGTACAGGCG
ATCGTTCAGG AGTTGAACGG CGAGAAAATC GATGTCGTGA AGTGGAGCCC CGACTCCTCG
AAGTTTGTGT CCAGTTCCCT GAGCCCGGCC AAGGTGATTG CGGTGGAGGT GTGGGAGGAC
GAAAAGATCG CCCGGGTGAT CGTACCCGAC TACCAACTGT CGCTGGCCAT TGGGAAGGAA
GGTCAAAACG CCCGCCTGGC CGCCAAGCTG ACCGGTTGGA AAATCGACAT CAAGAGCGAA
TCGCAGATGG CTGAAATTTA CCGGGAATAT CTTGAGCAGC AGGGCTACGA GCAGGTGTGA
 
Protein sequence
MNSEFLAALR DLEKERSISV EVLLEAIEAA LLSAYRRNFG TSHNARVQVD RHTGDCKVYA 
RRTVVQEVED PQDQISLEEA RAINPGYQLE DTVESEITPR NFGRIAAQTA KQVVVQRIRE
AERNMVFEEF ASREGDIVTG VVQRIEQRNV YIELGKTEAV LPPAEQIPRE NYRPGQRLKT
YIVEVKKTTK GPLILVSRTH PGLLKRLFEI EVPELHQGLV ELKAVAREAG IRSKIAVYSN
DEGIDPVGAC VGPKGARVQA IVQELNGEKI DVVKWSPDSS KFVSSSLSPA KVIAVEVWED
EKIARVIVPD YQLSLAIGKE GQNARLAAKL TGWKIDIKSE SQMAEIYREY LEQQGYEQV