Gene Ndas_3508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3508 
Symbol 
ID9247377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4213543 
End bp4216059 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content78% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003681415 
Protein GI297562441 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.40387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATGG TTCTGCGCGG CGCCGCCGAC CGCGCGCGCC AACTGGCCCT GTCGGTCCTG 
ACCGTGGCCC TGGGCGCCGG GCTGGCCACG GCCGTCCTCG CCCTCCAGGA CTCGGCCGAG
CGGGTGGCCG CCGGGGGAGC CGGGGCCTCC TGGACGCTGT CACGGGCACC GGTCGTGGTG
ACGGCCGTCC CCGAGGAGGC CGCGGCCGGG ATCACCGCCT CCCCGCTGGG GGAACCGCCC
CGGCTGGACC CGGATACCGT AGCGGAACTG GAGCGCCTCC CCGGTGTGCG CCGGACGGCG
GTCGAGGCGC CCTTCTCCGC CTACGTGGTC ACCGCCGACC GTACGCTCGG CGGCCACTCC
GACCGCTCCT TCGGCCACTC ATGGGCGCTC GCCGAGGCCG AGGGGCTCAC CCCCGCCACC
GGGCGGGCCC CCGAGAGCGT GGGCGAGGTG GTCCTCGACA CCCGCACCGC CGCCGACGCC
GGGCTCACCC CCGGCGACCG TGCGCGGGTC CTGACCTCCG ACGGGACCGC CGACGTGCTG
GTCACCGGAA CCGTCGAGCG CGGCGGCGCC CCGGACCGGG CCCTGTTCTT CCCACCCGTC
GAGGCCGCGC GTCGGGGCGG GGATCCGGTC CTGGCCCTGC TCTGGCCCGG GGAGGGAACC
GGCCCCGACC GGCTGGCCGG GGCGGTCGAG GAGGCCGCGC CCGGCGCCCG GGTGCTCACC
GGGGATGAAC GCTCCACGGC GCTGGCCCTG GACGGGGAGA ACCGGGACCT GGCCTCGGGC
ATGGGCCGGT TCCTGGGCAC CATGGCCGCG CTGGCGCTGG CGGTGGCCGC CGTCACGGTC
GCGGGCCTGC TCTCCCTGAC CGTGCGCGAC CGCGCCCGGG AGTTCGCCCT CCTGCGCCTG
GCCGGGGCCC GGCCGGGGCT GGTGCGCCGA CTCGTCGTCG GGGAGGCCCT GGTCCTGGGG
TGCGTGGCCG CCGCGCTCTC CTGCGTCGTG GGTACGGCGC TGGCGCTGCT CCTGAGCAGG
CTCTTCGCGG AGCTGGGAGC GCTGCCGGAC GGGTTCGCGC TGGTCCTGGG CTGGCCCCCG
CTCGCGGCGG GCGCGGCGCT GGCCCTGGCC GTGCCGCTGG CGGCCTCCTG GCGCCCCGCG
CTCACCGCGG GCCGGATCGC GCCCGTGGAG GCGATGCGCG CGGCGCAGGC CGAACCCGTC
TCCTTCTCCC GTGCCCGCCC GGTGCTCGGC ACCGTGGTCC TGTGCGGCGC GGCGGCCCTG
TTCGCGACGG CCTGGGGCCT GGCCGGAACC GTGGTGGCGG TGACCGCCGC CGCCACGGCC
GCGCTGGTCC TCGTCGCCGC CGCGGTCCTG CTCTCCCCGG TGCTGGTCCA CGCGGTGCTG
CTCCTGCTGC GCCCGCTCAC CCGCAGGAGG GCCGCCTCCC TGGTCGCCGA CCGGGAGGCG
CGCGCCGACG TCCGCCGGGT GGCGGGCGTC ATGACGCCCC TGCTGGTGAC CACGGCGGTC
GCCTGCCTGC TGCTCTTCCA GGAGACCACC ACCACCGAGG CCCGCCTTCG CGCCTACGGG
GAGCGCCTGG CCGCCGACCT GGTGGTGTCC GGGGCCCTGG GCGTGGGCCT GCCCGCGTCC
GCGGCCGAGG CCGCCGAGGG CGTGCCCGGG GTCGCCGCCG CCGGGGGCTA CCGCCAGACG
GTCACCTCCG CGGGCGGGCC GTACCTGACC ACCCACCTGG TCGAACCCGA GACGGTGCCG
CGGATCTACG ACCTGGCGGT GGAGGGTGGC GCGTGGGAGG ACTTCGGCAC CGGCGGCGTC
GCCGTGCGCG CCGACACCGC CCGGAGCCGG GGGTGGCGCG CGGGGCGGAC CGTGGAACTG
CTCGGCCCCG ACGGTACCGG GTTCACGGCC CGGGTGTCGG TCCTGTACCG GGCGGGGCTC
GACTTCCCCG ACGTCCTGCT GCCCGAGGAG GCCGTCGCGC CCCGGATGCT CGACACCCTG
CACAACGGCC TGTACGTGGT GCTCGACCCC TCGGCCGACG CCGGGAGGAC GGCCTCCCTG
CTGGAGGAGG CGATCGACGC CGGGCCCGAA CTCCGGGTCA GCGACCGCGC CGGACACATC
GCGGACCAGG CCCGGCTCGG CCAGGAGGAC GCGTGGATCA CCCACCTCAT GGTGGCCCTG
GTGGCGGGCT TCGCCGGGGT GAGCGCGGTC AACGCCCTCG TGGTCTCCGT CTCGGCCCGC
GCGCGGAGCT TCGCCCTGCT GCGGCTGGTG GGGGCCTCGC GGGCCCAGGT CGCCGGGATG
GTCGCGGGGG AGGCCCTGGC GGTGTCGCTG GCGGGGGTGG CGCTGGGCAC CGCCACGGCG
CTGACCGGGG TGGCCGCCGT GGGCCACGCC CTCGTCGGCG GCGGGACGGT GGTGCTCGCC
GTCCCCCTGG ACCAGTACCT GCCGCTGGCC GGTGCCGTCG TGGGCATCGG GCTGCTCGCG
AGCCTGGTCC CGGCCGTCGC GGCCCTGCGC GCCCGCCCGC TGCACGCCGC CGGGTGA
 
Protein sequence
MRMVLRGAAD RARQLALSVL TVALGAGLAT AVLALQDSAE RVAAGGAGAS WTLSRAPVVV 
TAVPEEAAAG ITASPLGEPP RLDPDTVAEL ERLPGVRRTA VEAPFSAYVV TADRTLGGHS
DRSFGHSWAL AEAEGLTPAT GRAPESVGEV VLDTRTAADA GLTPGDRARV LTSDGTADVL
VTGTVERGGA PDRALFFPPV EAARRGGDPV LALLWPGEGT GPDRLAGAVE EAAPGARVLT
GDERSTALAL DGENRDLASG MGRFLGTMAA LALAVAAVTV AGLLSLTVRD RAREFALLRL
AGARPGLVRR LVVGEALVLG CVAAALSCVV GTALALLLSR LFAELGALPD GFALVLGWPP
LAAGAALALA VPLAASWRPA LTAGRIAPVE AMRAAQAEPV SFSRARPVLG TVVLCGAAAL
FATAWGLAGT VVAVTAAATA ALVLVAAAVL LSPVLVHAVL LLLRPLTRRR AASLVADREA
RADVRRVAGV MTPLLVTTAV ACLLLFQETT TTEARLRAYG ERLAADLVVS GALGVGLPAS
AAEAAEGVPG VAAAGGYRQT VTSAGGPYLT THLVEPETVP RIYDLAVEGG AWEDFGTGGV
AVRADTARSR GWRAGRTVEL LGPDGTGFTA RVSVLYRAGL DFPDVLLPEE AVAPRMLDTL
HNGLYVVLDP SADAGRTASL LEEAIDAGPE LRVSDRAGHI ADQARLGQED AWITHLMVAL
VAGFAGVSAV NALVVSVSAR ARSFALLRLV GASRAQVAGM VAGEALAVSL AGVALGTATA
LTGVAAVGHA LVGGGTVVLA VPLDQYLPLA GAVVGIGLLA SLVPAVAALR ARPLHAAG