Gene Ndas_3306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3306 
Symbol 
ID9247168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3947832 
End bp3948974 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content72% 
IMG OID 
ProductPyridoxal-5'-phosphate-dependent protein beta subunit 
Protein accessionYP_003681218 
Protein GI297562244 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCCT CACCGACAGG TCCGCGGCCG GGCGACGACA CCCCGTGGGT GGACCGCTGC 
GACGACAGCG CCCGCTCCTG GGCGGACGAG GCGGTACGCA GGGTCGTCGC CGACGCCAAC
CGCTCGGCCG ACACCCACCT GCACGTGTTC CCGCTCCCCC CGGAGTGGGG CATCGACCTG
TACCTCAAGG ACGAGTCCGT GCACCCCACG GGCAGCCTCA AGCACCGGCT GGCGCGCTCG
CTGTTCCTGT ACGGGCTGGC CAACGGGTGG ATCCGGGAGG GCACCACCAT CGTGGAGGCC
AGCTCGGGGT CCACCGCGGT GTCGGAGGCC TACTTCGCCC AGCTGGTCGG CCTGGACTTC
ATCACGGTCA TCCCGCGCAG GACCAGCCCG GAGAAGATCG CCCTGATCGA GCGCTACGGC
GGCAGGTGCC ACTTCGTGGA CGCCCCGCCC GCGATGTACG CCGAGGCCGA GCGGCTGGCC
GCCGAGACCG GCGGGCACTA CATGGACCAG TTCACCTACG CCGAGCGGGC CACGGACTGG
CGCGGCAACA ACAACATCGC CGAGTCGATC TTCGAGCAGC TGGCCATGGA GCGCCACCCG
TGCCCGGAGT GGATCGTGGT GGGCGCGGGC ACCGGCGGCA CCTCCGCCAC CATCGGCCGC
TACCTGCGCT ACCGGCGCCT GGGCACCCGG CTGGCGGTGG TGGACCCGGA GAACTCGGCG
TTCTTCCCCG GCTGGGTCAC CGGCGCGTCC GACTACGCCA CCGGGATGCC CTCCCGGATC
GAGGGGATCG GCCGCCCCCG CATGGAGCCG AGCTTCGTGC CGTCGGTGAT CGACCTGATG
ATGCCGGTGC CCGACGCCGC GAGCATCGCC GCGATGCGCC ACCTGCACGA CCGCACCGGG
CTCTCCGCCG GAGGCTCCAC GGGCACCAAC CTGTGGGGGG TGTGGCACCT GGTGGCCAGG
ATGCTCCGGG AAGGGCGCAG GGGCAGCGTG GTCACCCTGA TCTGCGACGG CGGCGAACGC
TACCAGCACA GCTACTACAA CGACGCCTGG GTGGCGGAGC GGGGCCTGGA TCCGGCGCCC
TACCGCGCGA CCATCGACCG GTTCCTCGCG GAGGGGGTCT GGGAGCCTCC CCAGACCCCC
TGA
 
Protein sequence
MASSPTGPRP GDDTPWVDRC DDSARSWADE AVRRVVADAN RSADTHLHVF PLPPEWGIDL 
YLKDESVHPT GSLKHRLARS LFLYGLANGW IREGTTIVEA SSGSTAVSEA YFAQLVGLDF
ITVIPRRTSP EKIALIERYG GRCHFVDAPP AMYAEAERLA AETGGHYMDQ FTYAERATDW
RGNNNIAESI FEQLAMERHP CPEWIVVGAG TGGTSATIGR YLRYRRLGTR LAVVDPENSA
FFPGWVTGAS DYATGMPSRI EGIGRPRMEP SFVPSVIDLM MPVPDAASIA AMRHLHDRTG
LSAGGSTGTN LWGVWHLVAR MLREGRRGSV VTLICDGGER YQHSYYNDAW VAERGLDPAP
YRATIDRFLA EGVWEPPQTP