Gene Dole_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_3036 
SymbolnusA 
ID5695895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3642574 
End bp3644001 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content58% 
IMG OID641265652 
Producttranscription elongation factor NusA 
Protein accessionYP_001530916 
Protein GI158523046 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00201036 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCATAC AGGACGTGAA ACGTGTTGTA GAACAGGTCA GCCGGGACAA GGGCATTGAC 
AGGGACACTC TTGTGAAGGC CCTTGAAGAG GCCATCAAGT CTGCGGCTCG GAAGCGGTAT
GGCGCGGCTA TCGATATCGA AACCATGTAT GATGAGGATA CCGGCGAAAT TGAGATATTC
CAGTTCAAGG AGGTGGTTGA AACCGTCCAG GACCCGGATC TGCAGATCAC GTTTGTGGAG
GGCAGGGCCC TGGATCCGGA GTGTGAACTG GGAGACAGCC TCGGTGTGAA GATGGACACG
CAGTCCTTTG GCCGTATCGC GGCCCAGTCG GCCAAGCAGG TTATTATTCA GAAGATGCGG
GAGGCGGAGC GAAGCGCGGT CTACAAGAAT TTCGTGGAAA AAGAGGGTGA GATCATCAAC
GGTATCGTCT CCCGCATGGA GCGCGGTAAC GTGATTGTCA ATATCGGCGA GGCCGAGGCG
ATTCTTAACT CCCGGGAACA GATTCCCGGC GAAGGCTACC GCCGGGGGGA CAGGGTGCGG
GCCAATGTGA TGAAAGTGCT TGAGGAGACC GCCGGCCCCC AGATTATCCT GTCCCGGGCC
CATCCGGATT TTGTGGTCAA TCTCTTCAAG ACGGAAGTGC CGGAAATCAG CGAAGGTATC
ATTACCATCA AGGCCATTGC CCGGGAGGCC GGCGGGCGGA CCAAGATCGC TGTGGTTTCC
AACGACATGG ATATCGACCC GGTGGGTGCC TGCGTGGGGG TCCGGGGCAA CCGGATTCAG
AACGTGGTCA AGGAGCTCAA AGGGGAAAAG ATCGATATCG TTCCCTGGAA CCCGGACCCG
GCCAAGTTCG TCTGCAACGC GCTTTCTCCG GCAGAGATAG CCCGTGTGAT CATCGACGAG
GACAATGCGG CCATGGAGAT CATCGTGCCC GACGAGTCCC ACTCTCTTGC CATCGGCCGA
AGAGGGCAGA ATGTGCGGCT TGCCTCCAAG CTGACCGGCT GGCACCTTGA TGTACAGAGC
GAGTCCATAT ACACCCAGGC CATGGAACGG GGGTATGACA CGCTTCTTCA GATACCCGGT
GTGGATGGGT CCCTGGCAAA TGCGCTGTGT GAAGTCGGGT TTTTCTCGGC GGATGATATT
TCCGGTGCCG CGGTTGATGA CCTGATTGAA CTGGAAGGCA TTGATGAAGC CTCGGCAAAG
GCGTTGATCC GTGACGCGGT CAAGGTTGCG GAGCAGGCAG CCAGGGAGCA GGCAATCAGG
GAGAAAGCAG CCAAAGAGCA GGCAGCCAAA GAACAGGCAG CCAGGGTGCA GGCGGCAGAA
GAAGCATCTC CGGCGCCAGA TGAAGAAGCG CCGAATAAAG AGGCGCCGGA TAACGACATT
GCGCCAGCCG GAGAGACCCC GGCGGATGAC GGCCATGAGC CGGTATAA
 
Protein sequence
MIIQDVKRVV EQVSRDKGID RDTLVKALEE AIKSAARKRY GAAIDIETMY DEDTGEIEIF 
QFKEVVETVQ DPDLQITFVE GRALDPECEL GDSLGVKMDT QSFGRIAAQS AKQVIIQKMR
EAERSAVYKN FVEKEGEIIN GIVSRMERGN VIVNIGEAEA ILNSREQIPG EGYRRGDRVR
ANVMKVLEET AGPQIILSRA HPDFVVNLFK TEVPEISEGI ITIKAIAREA GGRTKIAVVS
NDMDIDPVGA CVGVRGNRIQ NVVKELKGEK IDIVPWNPDP AKFVCNALSP AEIARVIIDE
DNAAMEIIVP DESHSLAIGR RGQNVRLASK LTGWHLDVQS ESIYTQAMER GYDTLLQIPG
VDGSLANALC EVGFFSADDI SGAAVDDLIE LEGIDEASAK ALIRDAVKVA EQAAREQAIR
EKAAKEQAAK EQAARVQAAE EASPAPDEEA PNKEAPDNDI APAGETPADD GHEPV