Gene Ndas_5295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5295 
Symbol 
ID9249194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp458233 
End bp459387 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content72% 
IMG OID 
ProductDNA polymerase III, beta subunit 
Protein accessionYP_003683181 
Protein GI297564208 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACTTC GAGTGGACCG CGACGCGTTC GCCGAGGCGG TCGCCTGGAC GGCCCGGGCC 
CTCCCCACAC GGCCCGCCGT CCCGGTGCTC TCGGGGATCC GCATGGAGCT CTCCCCGGAC
GGCGCCTCCC TGCACCTGTC GGGGTTCGAC TACGAGGTCT CCACCCGCGC CTCGGTGGAC
GTCCTCTCCG AGGAGCCCGG CGCCGCCCTC GTCCCCGGCC GCCTCCTGGC CGAGATCGTG
CGCAACCTGC CGTCCGGCTC CGTGCACATC GACAGCGACG GCCCCAAGCT GCGCATCGTC
GGCGGCGCCG CGCGCTTCAC CCTCATCACC ATGCCGCTGG AGGACTACCC CACCCTGCCC
GGCATGCCCG GACGCATCGG CTCCGTCGCC GCGGACGCCT TCGCGGCGGC CGTGCGCCAG
GTGGCCCCGG CGGCCAGCCG CGACGACACC CTGCCCATGC TCACCGGCGT CTACCTCGAC
TTCAGCGGCG ACACCCTGAG CCTGGTCGCC ACCGACCGCT ACCGCATCGC CGTGCGCGAA
CTGTGGTGGA GCCCCGAGGA CCAGGGCCTG GACGCGGCCG CCCTGGTGCC CGCGCGCACG
CTGAGCGACA CCGTGCGCGG TCTGCTCACC AAGTCCAACG TGGACATCGC CCTGTCCACG
GCCAGCGGAG GCGAGGGCGT CAGCCTCTCC CCGGGCGAGG GCATGATCGG CTTCGAGAAC
GGCGAGCGCC GCACCACCAC CCGGCTGATC GACAGCGAGT TCGTCAAGTA CGCCGCCTGG
TTCCCCAAGG AGTTCTCCGC GCGCGCCGAG GTGGCCGTCA CGCCCCTGGC GGAGGCGGTC
AAGCGCGTGG CGCTGGTCGC CGACCGCAAC ACCCCGCTGC GGCTGGCCTT CTCCGAGGGC
GAGGTCGTGC TGGAGGCGGG GTCGGGAGAG GACGCCCAGG CCGTGGAGGC GATCGAGGTC
GGCTACGAGG GTGAGCCCCT GCGGCTGGCC TTCCGTCCCG ACTACCTCAT GGACGGGCTG
GCGGGTGTGG AGACCGACAC CGCCTACCTC AACTTCACCG AGCCGACCAA ACCCGCCGTG
TTCACCGACG TGCCGGCCAA GGAGGGGGAG AACCCCTCCT TCCGTTACCT GGTTCAGCCT
TTGCGGGTGT CCTGA
 
Protein sequence
MKLRVDRDAF AEAVAWTARA LPTRPAVPVL SGIRMELSPD GASLHLSGFD YEVSTRASVD 
VLSEEPGAAL VPGRLLAEIV RNLPSGSVHI DSDGPKLRIV GGAARFTLIT MPLEDYPTLP
GMPGRIGSVA ADAFAAAVRQ VAPAASRDDT LPMLTGVYLD FSGDTLSLVA TDRYRIAVRE
LWWSPEDQGL DAAALVPART LSDTVRGLLT KSNVDIALST ASGGEGVSLS PGEGMIGFEN
GERRTTTRLI DSEFVKYAAW FPKEFSARAE VAVTPLAEAV KRVALVADRN TPLRLAFSEG
EVVLEAGSGE DAQAVEAIEV GYEGEPLRLA FRPDYLMDGL AGVETDTAYL NFTEPTKPAV
FTDVPAKEGE NPSFRYLVQP LRVS