Gene Ndas_0002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0002 
Symbol 
ID9243828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3397 
End bp4536 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content71% 
IMG OID 
ProductDNA polymerase III, beta subunit 
Protein accessionYP_003677961 
Protein GI297558987 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000269969 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00612818 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGTTCC GGGTCGAACG CGACGTACTG GCCGAGGCGG TCGCCTGGAC CGCGCGCACA 
CTCCCGACGC GCCCCTCGGT GCCGGTGCTC GTCGGCATCC TGCTGGAGGC CGGTGAGTTC
GACGGCCTCC AGCAGCTGCG CCTGTCCGGC TTCGACTACG AGGTCTCCAC CCAGGCCGCG
GTGGACGTCG AGGTCGAGGA GCCGGGCACG GTCCTGGTCA CCGGTAAGCT CCTGGCCGAG
ATCACCCGCA ACCTCCCCGC GCAGCCCGTG GAGATCTCCA CCGACGGCGC CAAGGTCGTC
GTCACCGGCG GCAGCGCGAA GTTCACCCTG ACCACCATGC CGGTGGAGGA CTACCCCACG
CTCCCGGAGA TGCCCGGTGT GAGCGGGACC GTCGGCAGCG ACGCCTTCGC CGCCGCGGTC
AGCCAGGTGG CCGTGGCCGC CGGGCGCGAC GACACCCTGC CGATGCTCAC CGGCGTGCGC
GTCGAGATCG AGGGCGAGAC CATCACGCTC GCCTCCACCG ACCGCTACCG CCTGGCCGTG
CGCGAGTTCA CCTGGAAGCC GGAGAACCCC GACCTGTCCG CGGTCGCGCT GGTCCCGGCC
AAGACCCTCC ACGACACCGC CAAGTCGCTC ACCTCGGGCG CCGAGGTCTC GATCGCCCTC
TCGGACGGCG GCTCCGGCGA GGGCATGATC GGCTTCGAGG GCGGCGGCCG CCGCACCACG
ACCCGCCTGC TCGACGGCGA GTTCCCCAAG TACCGGGCGC TGCTGCCGGA CACCTTCAAC
TCGGTGGCCG AGGTCAGCCG CTCCGAGTTC GTCGAGGCGG TCAAGCGCGT CTCGCTGGTC
GCCGAACGCA ACACCCCGCT GCGGCTGTCC TTCAGCCAGG GCCAGCTGGT CCTGGAGGCG
GGCACCGGCG AGGAGGCGCA GGCGGTCGAG GTCCTGGAGG CCGACCTGGA CGGCGACGAC
ATCCAGATCG CCTTCAACTC CGGGTTCCTC CTGGACGGGC TCGGCGCCAT CGGCACCGAC
GTGGCCCGCC TGCACTTCAC CACCTCGACC AAGCCGTCGA TCCTGACCGG CAAGCCCGCG
GAGGAGGGTT CCTCCCCCGA GTACCGCTAC CTGATCATGC CGGTGCGTCA GCCGGGCTGA
 
Protein sequence
MKFRVERDVL AEAVAWTART LPTRPSVPVL VGILLEAGEF DGLQQLRLSG FDYEVSTQAA 
VDVEVEEPGT VLVTGKLLAE ITRNLPAQPV EISTDGAKVV VTGGSAKFTL TTMPVEDYPT
LPEMPGVSGT VGSDAFAAAV SQVAVAAGRD DTLPMLTGVR VEIEGETITL ASTDRYRLAV
REFTWKPENP DLSAVALVPA KTLHDTAKSL TSGAEVSIAL SDGGSGEGMI GFEGGGRRTT
TRLLDGEFPK YRALLPDTFN SVAEVSRSEF VEAVKRVSLV AERNTPLRLS FSQGQLVLEA
GTGEEAQAVE VLEADLDGDD IQIAFNSGFL LDGLGAIGTD VARLHFTTST KPSILTGKPA
EEGSSPEYRY LIMPVRQPG