Gene Ndas_1500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1500 
Symbol 
ID9245350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1839794 
End bp1840909 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content77% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003679436 
Protein GI297560462 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.472105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000928379 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCCGGT GGCGCCGTGG TGAAGTCCGT CAGATCCTCC CCTCCTGGCC GGGGGTCGTG 
CTGTGCGCGG TGGCGCTGGA GCCCGTGGCC GAAGCGGACC GCAGAGGCCG CGGGGCCGCG
GGAGCGGGCG GAGAGGTGAC GTGCCGCGCG CTGGCCTACA CCGACCTGGT GGGCGCGCCC
GAGGCGGGCG ACACCGTGCT GCTCAACACC GGCGCCCTGG ACCTGGGGCT GGGGACGGGC
GGGTACGCGC TGGTGGTGGC CGTGCCCGAC CGGCTGCCCC CTGACCCCCG CGGGCCCGGC
CACCTGGTCA AGGCCCGTTA CACGCCGTTG CAGGCCACCG TGCTGGGCGC CGACGAGCAG
GGCTCGCCCC ACCACGAGGT GCTGCGCTCG GCCGAGGGTG TGGAGGGCAT GCCGGTGGTG
GTGGCCGACC TGCACTCGGC GCTGCCCGCC GTGGTGGCGG GGGTGCGCGC GCACCGGCCG
GGGGCGCGGA TCGTCCACGT GATGCTGGAC GGGGGCGCGC TGCCCGCGGC GTTCTCGCGG
CTGGTGGGCG CGCTGCGCGA GGAGGGTCTG CTGGCGGGGT GCGTGACCAC GGGGCAGTCC
TTCGGCGGTG ACCTGGAGGC GGTGACCGTG CATTCGGGGC TGCTGGCTGC CCGGCACGTG
CTGGGTGCGG ACGTGGCGGT GGTGTGCCAG GGTCCGGGCA ACCTGGGCAC CGGGACGCCG
TGGGGGTTCT CGGGAGTCTC GTGCGGTGAG GCGGTGAACG CGGCGGCGGT GCTGGGCGGG
CGTCCGGTGG CGTCGCTGCG GGTGAGCGAG GCCGACGCGC GCGAGCGGCA CCGGGGGGTG
TCGCACCACA GCCTGACCGC CTACGGGCGG GTGGCGCTGG CGCGGGCGGA GGTGGTGGTG
CCGCTGCTGC CGGGGGTGTT CGGCGAGCGC GTGCGCGTCC AGGCCGGGGC TCTGGGCGAG
CGGCACACCC TGGTGGAGGT GGGTGTGGAC GGGCTGGAGG AGGCGGTGCG GACGCTTCCG
GTGAAGGTGT CGACGATGGG CCGTGGCCTG GAGGAGGACC GCGCGGCCTT CCTGAGCGCC
GCGGCGGCGG GCCGCCACGC GGCCGCGCTG CTGTAG
 
Protein sequence
MIRWRRGEVR QILPSWPGVV LCAVALEPVA EADRRGRGAA GAGGEVTCRA LAYTDLVGAP 
EAGDTVLLNT GALDLGLGTG GYALVVAVPD RLPPDPRGPG HLVKARYTPL QATVLGADEQ
GSPHHEVLRS AEGVEGMPVV VADLHSALPA VVAGVRAHRP GARIVHVMLD GGALPAAFSR
LVGALREEGL LAGCVTTGQS FGGDLEAVTV HSGLLAARHV LGADVAVVCQ GPGNLGTGTP
WGFSGVSCGE AVNAAAVLGG RPVASLRVSE ADARERHRGV SHHSLTAYGR VALARAEVVV
PLLPGVFGER VRVQAGALGE RHTLVEVGVD GLEEAVRTLP VKVSTMGRGL EEDRAAFLSA
AAAGRHAAAL L