Gene Ndas_3167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3167 
Symbol 
ID9247024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3791309 
End bp3793111 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content78% 
IMG OID 
Productprotein of unknown function DUF43 
Protein accessionYP_003681081 
Protein GI297562107 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGCGA CTCCTCCGGG CCCGGACGGC CCCCCTTCCG CTGCCTCCGG GCGGGTACCG 
GTCGCGGACG GCGACTCCGC CCGCGCCGAC CGCCCCCTTG ACGGCCCGCC GCCCCTTCCC
GAGCCCGCCG CGGAGCTGCT GCGCGGCCAC GGCGTGGACG CGCCCCGGCT GCGCCGTGTG
CTGGCGCTGC TCTCGGACGG GCGCGAGTGG GAGGCGAACG CGCTGGTGCG CGCCAGCGGC
GTCGCGTACG CGCTGGTGTC CTCCCTGGTG GGGGCCCTGA CGGAGGCCGG TGAGCTGGCG
CCCGGGGACG GGCGGGGCCG GGTGCGCCTG GTGCGGCCCG AGCGCTACGC GGGCGCGGCC
GGGGAGGAGC CCGCCGACCC GGTGGCCCAC CTGCTCCCCC GCTACTCCCG GGCGCTTGCG
GAGCTGGCGC GGGCCGTGGA GGAGGGGCCC GCCTCGGATC TGGACCTGGA CCACGTGGCC
GCGACGGCCG AGACCGCGCT GCGCCGGGCG CTGCTGCTGT CCACCCGGTT CGACCTGCGC
GACCGCACGC TCCTGTGCGT GGGCGACCAC GACCTGACCT CGGTGGCGCT GACCCTGGTG
TGCCCTGGCG CCCGGGCGCA GGTCGTGGAC ATCGACGAAC GCGTCCTGGC CCACATCGAC
TCCCTGGCGG CCGAGCTCGG GCTGGCGGTG CGTACGCACG CCGCGGACCT GCGTCTGGGG
CTGCCGCCCG CGGTGCGCGG CGGCGCGGAC GTGGTGTTCA CCGATCCCCC CTACACGCCC
GACGGGGTGG AGCTGTTCGT GCGGCGCGGT GTGGAGGGGA TGGCCGATCC CCGCCGGGGC
CGGGTGCTGG TGGCCTACGG GGCCAGTGAG ACCACCCCGC GGCTGGCCGC GGCCACGCAG
GCGAGGCTGG TGCGGATGGA CCTGCTGTTG GAGGCGGTGT GGCCGGACTT CAACCGCTAC
CACGGGGCCG AGTCGATCGG CGCGGCCTCC GACCTGTACG TGCTGCGGCC TCTGGCGCGT
ACGCTGCCCG CTCCCTCGGG TGAGGTGGCG CGCGTGTACA GCCAGGGGGT GAACGCCAAG
GAGGCCCGGG GCGGTCTGGA CGCCGACCGG GCCCGCGCCG TGCTGGACCG GGTGGCCGGG
GAGACCGCGA GTGCCGCGGG TCCCGGCGGG ACCGGGGAGA CCGTTGGGGC CGGGGAGGCC
GGTGGTGATC GCGCGCCGAC GCTGGTGGGC GCGTGGCCGG GCGAGGTCGC CGGGTCGGGG
CGGGTGCGCC TGTCCACGTG GTTGGAGGCG CCCGGGCAGG GCGGCGGACG CGCGGTGGTC
AATCTCACGG GCGGGTGGGA CCGCTGGGCG GCGCGGGCGG CCCTGGCCGC CGCCGGGGAC
ACCGTGTACG TGCTGGTGCC GTCCTCGGCG GCGTGCGTGC GCGACGAGGC GGGCCAGCGG
GGGCTGCGCG CCCTGGTGGA ACCGCGCTTC GGGGTGCGGT TCCTGCGCGG GTTCGGCGCG
GACGGCCTGA CGGCGGTGCG GCTGACCCGG CGCCCGGACG CGGACTCGGC GGTGGACCGG
CTGCTGGTGT ACGTGCAGGA GAGGGCGCAC GGCACGCTCA CCGCGACCCT GCGGGCGGGG
CTGGTGGAGG TGTCGGCGTG GCGGGAGAGC CCGGTCAACA AGCGCACGGC GCGCCACGCG
GTGGCCGCGG CGCCGGAGTG GGTGTCCGGG CACACCCTGC TCGATCTGCC CGAGCACCGC
TTCGGTGAGC TGCGCGGGGT GGCCGCCGAC CTCCTGGAGC GGGTGGGGTC CTCCCCGGCC
TGA
 
Protein sequence
MHATPPGPDG PPSAASGRVP VADGDSARAD RPLDGPPPLP EPAAELLRGH GVDAPRLRRV 
LALLSDGREW EANALVRASG VAYALVSSLV GALTEAGELA PGDGRGRVRL VRPERYAGAA
GEEPADPVAH LLPRYSRALA ELARAVEEGP ASDLDLDHVA ATAETALRRA LLLSTRFDLR
DRTLLCVGDH DLTSVALTLV CPGARAQVVD IDERVLAHID SLAAELGLAV RTHAADLRLG
LPPAVRGGAD VVFTDPPYTP DGVELFVRRG VEGMADPRRG RVLVAYGASE TTPRLAAATQ
ARLVRMDLLL EAVWPDFNRY HGAESIGAAS DLYVLRPLAR TLPAPSGEVA RVYSQGVNAK
EARGGLDADR ARAVLDRVAG ETASAAGPGG TGETVGAGEA GGDRAPTLVG AWPGEVAGSG
RVRLSTWLEA PGQGGGRAVV NLTGGWDRWA ARAALAAAGD TVYVLVPSSA ACVRDEAGQR
GLRALVEPRF GVRFLRGFGA DGLTAVRLTR RPDADSAVDR LLVYVQERAH GTLTATLRAG
LVEVSAWRES PVNKRTARHA VAAAPEWVSG HTLLDLPEHR FGELRGVAAD LLERVGSSPA