Gene Ndas_4526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4526 
Symbol 
ID9248406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5367404 
End bp5368708 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content69% 
IMG OID 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_003682419 
Protein GI297563445 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.442716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGGCA AGACACCTCC GGCGGTGGCG CGTCGTTTCG GTATCGGACT GGCAGCGCTG 
GCGCTGGCCG CCACGGCCTG CACGTCGGGT GAGGCCGAAA CCCAGGGCGG CGGTACCGAC
GCGGCCAGCG CCGATCCCGC GGAGCTGGTG ATCACTCCCG AGAACGGTGC CGAGGAGGTC
GCGCCCAACT CCCCCATCCG GGTCACCGCG GAGAGCGGTG TCATCACGGA CGTCCAGGTG
GAGCAGGTCG TCGCCACCGA GGCCGCGGCG GAGGGCGAGG GCGAGGCGCA GGAGGCCGAC
CTCTACGCCA TGACCGGCAC CCTCAACGGT GACGCGACCG AGTGGGTCAG CGACTGGAAC
CTGCGCCCCG GCGCGGAGGT CGTGGTCACC GCGACCGCCG AGAACGACGC CGGTGAGGAG
ACCGAGGTCG TCCAGGAGTT CACCACCCTG GAGGCGGTCG CCGGACAGCG CCTGGAACTG
GCCTCCAACT GGCCCGTCTC CGGTGACACC GTCGGCGTGG GCATGCCGAT CGTCATCAAC
TTCGACCTGC CGGTGACCAA CAAGGCGCAG GTCGAGAACT CCATGGAGGT CATCTCCGAG
CAGGGCGTCC AGGGCGCCTG GAACTGGGAG ACCGACACGA TGGCCGTGTT CCGGCCCGAG
GAGTACTGGG AGCCCCACCA GTCCGTGAGC GTCGACCTGC GCCTCGCCGG CGTCGAGGCC
TCCGAGGGCG TCTACGGCGT GGAGAACCAC CGCATCGACT TCGAGGTCGG CCGCGAGCTG
ATCATGACCA TGCACGTGCC CGACCACGAG CTGGTCGTCA ACATCGACGG TGAGCACGAC
CGCACCATCG AGGTGAGCAA CGGCAAGGCC AGCCGCCGCT TCGACACCAC GACCTCCGGC
ACCCACGTGC TCATGCAGCG CTACGAGCAG ATGACCATGG ACTCCTCCAC CGTGGGCATC
CCCGAGGGCA CCCCCGGCGC CTACAACGTG GACGTGCAGT ACGCGGTCCG CACCAGTGAC
AGCGGCGAGT TCCTGCACGA GGCCTCCTAC AACGGCAACA TCGGCAGCGC CAACACCTCC
AACGGCTGCA CCAACCTGCG CATGGACGAC GCCCGCTGGA TCTTCGAGAA CACCCTCATG
GGCGACGTCC TGGAGACCAC CGGTACCGAC CGCGAGCTGG AGTGGAACAA CGGCTGGGGC
TTCTGGCAGA TGTCCTGGGA CGAGTGGCTG GCCGAGAGCG CGACCGGCGA GCCGCAGGTG
ACCGACGGTT CGGGCACCCC CGGCTCCGTC CACGGCGAGC AGTAA
 
Protein sequence
MTGKTPPAVA RRFGIGLAAL ALAATACTSG EAETQGGGTD AASADPAELV ITPENGAEEV 
APNSPIRVTA ESGVITDVQV EQVVATEAAA EGEGEAQEAD LYAMTGTLNG DATEWVSDWN
LRPGAEVVVT ATAENDAGEE TEVVQEFTTL EAVAGQRLEL ASNWPVSGDT VGVGMPIVIN
FDLPVTNKAQ VENSMEVISE QGVQGAWNWE TDTMAVFRPE EYWEPHQSVS VDLRLAGVEA
SEGVYGVENH RIDFEVGREL IMTMHVPDHE LVVNIDGEHD RTIEVSNGKA SRRFDTTTSG
THVLMQRYEQ MTMDSSTVGI PEGTPGAYNV DVQYAVRTSD SGEFLHEASY NGNIGSANTS
NGCTNLRMDD ARWIFENTLM GDVLETTGTD RELEWNNGWG FWQMSWDEWL AESATGEPQV
TDGSGTPGSV HGEQ