Gene Ndas_3793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3793 
Symbol 
ID9247662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4554010 
End bp4555041 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681697 
Protein GI297562723 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.146653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAAGT GGATCATCAT CGGCGCGGTC CTGGTCCTCG TGGGAGGACT GACCGGCGCC 
GGCTACCTCC TGGTGTCCCG TCTGGGCGGC CCGGGCGCCG GAGAGGACGC CGGCATGCAG
GAGGCCGCCC TGGACGTGGG CGGGGCCCCG GTCGAGGTGG TGCTGGGAGA GGTCAGCTCC
GTGCTGGTGC TGGACGCCGT CGTCCGGGCC GAGCCCGGCG AGGCGGTCGA GGCCCGCAAC
GGGGGTACGG TCTCCCACCT GTGGGTGGGC GAGGGCGCCG AGGTCGACCG GGGCGCCCCG
GTGGTCAACG TGAGGGTGCC CGCCGAGGGC GCGCCCGTCA CCGGGGAGGA CGGCCAGGCC
CCCGACGCCC CCACCACCGA GGAGGTCACC CTGTACGCGC CCGCCGCCGG AACGGTGTCG
GGGCTGGAGG ACGTCATGGT CGGCGACGTG CTGGAGCCCG GCGCGGTGGT GGCCACGGTC
GCCCCCGAGG AGTTCCGGGC CGTCGCGTCG ATCCCGCCCA ACGACCTCTA CCGGTTCTAC
GAGGACCCCG AGGACATCCT GCTCCAGATC GACCAGGGGC CGCCCGCCGC CGCGTGCGAG
TTCCTGTCCC TGGGCACGGC CGAGGGCGGC GCCCCCGTCC CCGGGGACGG GGGCCGCGAG
GGCACCGAGG ACGCCGGCGG GGGCGGCGGC GGGGCCGAGC TGGCCTGCCG CGTGCCCGCC
GACCTGGAGG TCTTCGAGGG CGTCCAGGGA CAGCTGTCGG TCAGCACCGG CGCGGCCACG
AACGCGATCA TCGTCCCCGT GACCGCGGTG CGCGGCACCG CCGGGTCGGG CGAGGTCGTG
GTGGTGGGCG AGGACGGCAC CGAGGAGACC CGCGAGGTGG TGCTGGGCGT CTCCGACGGG
TCCTCCGTGG AGGTCACCGA GGGACTGAGC ATCGGCGAGA CGGTCCTGGA CCCGATCCCG
CTCGACCCCC GGTTCGACGT GCCCGGCGCG CACGACCCCG AGGACGCGTT CGTCGAGGAG
GGTGAGGGCT AG
 
Protein sequence
MKKWIIIGAV LVLVGGLTGA GYLLVSRLGG PGAGEDAGMQ EAALDVGGAP VEVVLGEVSS 
VLVLDAVVRA EPGEAVEARN GGTVSHLWVG EGAEVDRGAP VVNVRVPAEG APVTGEDGQA
PDAPTTEEVT LYAPAAGTVS GLEDVMVGDV LEPGAVVATV APEEFRAVAS IPPNDLYRFY
EDPEDILLQI DQGPPAAACE FLSLGTAEGG APVPGDGGRE GTEDAGGGGG GAELACRVPA
DLEVFEGVQG QLSVSTGAAT NAIIVPVTAV RGTAGSGEVV VVGEDGTEET REVVLGVSDG
SSVEVTEGLS IGETVLDPIP LDPRFDVPGA HDPEDAFVEE GEG