Gene Ndas_2729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2729 
Symbol 
ID9246580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3264638 
End bp3266551 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content72% 
IMG OID 
ProductProtein of unknown function DUF1998 
Protein accessionYP_003680648 
Protein GI297561674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0213066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.652268 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAAGG AGACGCACCG GCGCAGGGTC GGCTCGGTCC GGCCCAGCCA CCTGATGTTC 
ACCAGCGGCG TGGGGTCCCT GGTGGACCTG CCCAACTTCG CGGTGCTGGT GCGCGGGCTG
GACGAGTGGA ACCACACCCA CGCCTACGGC TGGGAGCCCA TCGTCGAGCC CCGGCTGCTC
AAGGTGATCC AGGAGCAGCC GAGCCACCGC AACGTGAAGG AGCTGCGCCC GGCGCCGTGG
ACCGAGGGGC TGGAGCGCGA CCCCGGCGGA CCCGCCGCCG GGGTGGGCGT GCCCACCACC
CCGTTCCCGT CCTGGTTCCG GTGCACCTCC TGCGACGAGC TGGTGGCCCT GGACTCGGAC
ATGCTCGCCT TCGAGAACAC CAACCCGCGC AGGCCGCACG AGGCCCGCTT CGTGCACAAC
GTCGGCAAAC ACAAGAAGGG CAAGCCGCTG GCGGTCCCCG CCCGGTTCGT GCTGGCGTGC
ACCGACGGGC ACCTGGACGA GTTCCCCTAC GTCCACTTCG TGCACCGGGG GGAAGCGTGC
CCCAGGGCCG AGAAGCCGCA GCTGAAGATG GAGGACCGGG GCGGGAACGT CGGCGCGAAC
GTGGAGCTGA CGTGCGTGGT GTGCGGCGCG CACCGCAACA TGCGCGACGC GGGCGGTGCG
CGGGGTAAGG AGAACCTGCC GGCGTGCCGA GCACGCCACC CCCACCTGGG CGTGTTCGAC
CCGGAGGGAT GCAGTCAAAA TCCCAAGACC CTGGTGCTGG GCGCGTCCAA CCAGTGGTTC
TCGGAGCTGC TGTCGACGCT GGCGGTCCCC TCCGGCCAGG GCACGGGCGA ACTGGACTCC
CTGGTGGAGC AGTACTGGGA CATGCTGGAG GAGACGCCCC AGAGCCAGTA CAAGATCATG
CGGCAGTTCG CGCCGCCCAT GCGCGACCAC TTCGGCAAGT GGGACGACGA CACCGTGTAC
GAGGCGGTGG AGCGGCGCCG CGCGGTCCTG GAGGGGAAGG CCGGGGACGG CGGGAAGGAC
GCGCCCTCGG GCCGCCAGGC CCTGCGCACC GCCGAGTGGG AGGCGCTGTC CTCCCCCGAC
GCCCACGAGC CCCGGCCCGA CTTCGCGCTG CGCCGCCTGG AGGGCGGCGT GCCCGAGGAG
CTGCGGGGCG TGTTCGCCGA CGTGGTGCAG GCCGAACGGT TGCGCGAGGT GCGGGTGCTG
ACCGGGTTCA CCCGCCTGGA CTCCCCCGAC CTGGACGACC CGATGATGGT GCAGACGGTG
CGGCTGTCGC GCGACGAGGC GACCTGGCTC CCGGCCAGCG AGGTGCGCGG CGAGGGCGTC
TTCCTGCGGG TCCCGGAGGA GCTGCTGGCC GCCTGGGAGA AGCGGGTGGC CGACAGCGAG
GCCCTGGAGC TGCACCGGGA GGCCTACGGC ACCTTCCGGG AGAAGCGCTA CTCCGACCGG
GTCGGCTCGG GGTTCGAGCG GATGCGCAAC TGGCCGGGCG CGCGCTACGT CGCCCTGCAC
ACCCTGTCGC ACCTGCTGAT CCGGACCATC GCCCTGGACT GCGGGTACAA CGCGGCGAGC
CTGTCCGAGC GCGTCTACGC CGGGACCGAG GAGGACCCGC GCGGGGGCAT CCTCATCTAC
ACGGCGGTGC CCGACGCCGA GGGGACGCTG GGCGGTCTGG TCTCGCAGGC CGAGCCGGAG
CGGCTCGTAC ACCTGGTGCG CAAGGCCCTG CACGGGGCCA TGCACTGCTC CTCGGATCCG
CTGTGCGCCG AGCGCCTGCC GCAGGCGAAC GCGGACTTCC TGCACGGGGC GGCCTGCCAC
GTGTGCCTGT TCGTGTCCGA GACGACCTGC GAGCACGGCA ACCGGTTCCT GGACCGCAGG
TTCGTGGTGC CGATCGGGGA TCCGGAGCTG GCCCTCTACC CTGAGCTTCC GTGA
 
Protein sequence
MHKETHRRRV GSVRPSHLMF TSGVGSLVDL PNFAVLVRGL DEWNHTHAYG WEPIVEPRLL 
KVIQEQPSHR NVKELRPAPW TEGLERDPGG PAAGVGVPTT PFPSWFRCTS CDELVALDSD
MLAFENTNPR RPHEARFVHN VGKHKKGKPL AVPARFVLAC TDGHLDEFPY VHFVHRGEAC
PRAEKPQLKM EDRGGNVGAN VELTCVVCGA HRNMRDAGGA RGKENLPACR ARHPHLGVFD
PEGCSQNPKT LVLGASNQWF SELLSTLAVP SGQGTGELDS LVEQYWDMLE ETPQSQYKIM
RQFAPPMRDH FGKWDDDTVY EAVERRRAVL EGKAGDGGKD APSGRQALRT AEWEALSSPD
AHEPRPDFAL RRLEGGVPEE LRGVFADVVQ AERLREVRVL TGFTRLDSPD LDDPMMVQTV
RLSRDEATWL PASEVRGEGV FLRVPEELLA AWEKRVADSE ALELHREAYG TFREKRYSDR
VGSGFERMRN WPGARYVALH TLSHLLIRTI ALDCGYNAAS LSERVYAGTE EDPRGGILIY
TAVPDAEGTL GGLVSQAEPE RLVHLVRKAL HGAMHCSSDP LCAERLPQAN ADFLHGAACH
VCLFVSETTC EHGNRFLDRR FVVPIGDPEL ALYPELP