Gene Ndas_4624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4624 
Symbol 
ID9248505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5492321 
End bp5493754 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content77% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682516 
Protein GI297563542 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000471714 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.246697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCCACG ACGCGGTCGC CCTTCTGGCC AGCCCTCCCG ACCGGCGCGC CCTGACCAAG 
GCCCTGGTCG CCGCCGGACC CGACCTCCGG GTGCGGTTCG CGGCCGACGG CGCCGTGGTC
GAACTGCTCG ACGCCGGGGG CCGCCTGGTC GCCGCCGTGC AGGCGGCCCA GCGCCTGGCC
CTGTCCGCCG AGGCCGAACG GCTCCTGTCC GACGGCATGG TCGACGACCT GCCCGCCCAG
CCCTACTGGG TGGAGGCGCG CGGCGCCGAA CTCGCGGACA CCGACACCGC CGGGGCCGTC
GGCCGCTTCG TCCGCGACCT CGCGGACCGG CTCGGCGGCG TCGTGTGGGA GCCCGAGCCG
CGGCTCTCCC GCGGCGACGC CTTCCTGGAC GGCTCCACCG ACCACCCCGC CGTCACCGCC
CGCACCGACA GGGCCGTGGT CGTGGTGCAG GACCGCCCCC TGGTCCCCAT GTCCCCCTGG
CTCGTGGACA CCGTCGCCGC CCACGGGCGC GAGGGCCTGC GCCTCCAGGT CGTCACCCCC
TCCACCAGCC GCCTCACCCA CGCACTGCGC TCGGTGCTGG CCGACCCCAC CGCCCGCTGG
GTGGTCCAGG CCCCCGACGG CGCCTACTAC GACGGGTTCT CCGGGGTGCC CCTGGTCTGG
GACGAACGGG AGGCCTTCGT CCTGGACCGG AGCGCCCGGG CCGAGGACGG ACCGCACGAG
GCGTTCCGCG CCCGGGCCGA GGATGTGGAG GGGACCGGCT CCCACCTGCT CGTCGAGCTG
AAGGCGGAGC ACCCCGCCGA CAACGGCCTG GTCCTGGGCG AGGCCGCCGA ACTGCTCGCC
GAGCGCCTGG GCGGCCGCGC CCCCGCGCTC TGGGGCACCA GCGAGCCCCT CCCCCAGGAG
TGGAACCGGG CGGCGCTGAC CCGGCTGTGC CGCGAACGCG CGCCCGGGCA GACCTGGTTC
GTGTTCACCG GCCCTCCCGA GGGCGTGCGC GAGGAGGGCG TGCTCCCCTT CTGCGGCACC
CAGCGGGTGA TGCGCACCGC GCACGGGGTG CGTGAGAGCG TCTCGTTCGC GGTGGCCCGG
CCCGCGGGCG AGGAGCACGA CCTGGACGCG TTGTCGTCGG TGGTCCGTAC ACTCACCGAA
CGCGATGTGC TGCGGACCAT GACGGTGCGG CGCGCGGCCG GGCGGCCGGA CCTGACCCAC
GAGCCCCGCT GGTGCGGCCT CCCCCTGCCG GTCGGCCTGG CCGTGGGGGT GGAGGGCGTC
TCCTCGATCG GCACCGACCG GGCGCTGTCC GCTCCGGTGC GCGGGGTGCC GTTCGGCCCG
CCGCTCACGC CCTCGGTCTG GTACCGGGTC GGGGACGGCA CCGAGCCGGA CGGCTGGCAG
CGCTTCCGCG AGCTCATGGA CCACCTGCAC CCCGACGGGG CCCGCGCGGG CTGA
 
Protein sequence
MSHDAVALLA SPPDRRALTK ALVAAGPDLR VRFAADGAVV ELLDAGGRLV AAVQAAQRLA 
LSAEAERLLS DGMVDDLPAQ PYWVEARGAE LADTDTAGAV GRFVRDLADR LGGVVWEPEP
RLSRGDAFLD GSTDHPAVTA RTDRAVVVVQ DRPLVPMSPW LVDTVAAHGR EGLRLQVVTP
STSRLTHALR SVLADPTARW VVQAPDGAYY DGFSGVPLVW DEREAFVLDR SARAEDGPHE
AFRARAEDVE GTGSHLLVEL KAEHPADNGL VLGEAAELLA ERLGGRAPAL WGTSEPLPQE
WNRAALTRLC RERAPGQTWF VFTGPPEGVR EEGVLPFCGT QRVMRTAHGV RESVSFAVAR
PAGEEHDLDA LSSVVRTLTE RDVLRTMTVR RAAGRPDLTH EPRWCGLPLP VGLAVGVEGV
SSIGTDRALS APVRGVPFGP PLTPSVWYRV GDGTEPDGWQ RFRELMDHLH PDGARAG