Gene Ndas_0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0503 
Symbol 
ID9244344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp620326 
End bp621729 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003678456 
Protein GI297559482 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCATTA TTGATAACGC AAAGCTTACG CGCCGGTCGG TCCCGAGGAG GACCGGCCGG 
ACCTTCCTGG TCACAGCCGC CACCCTGGCC CTGGCGGCGG GCGCCCTGTC GACCGCCCCG
GGGTTCCGGG GCTGGTGGGC CCCGGCCCCC GCCTCCGCTG CCGTCACGGT CAACCCCGTC
GCCCCCGCAC TCGGGTTCAA CGCCATCTCC GAGGGCGACA CCTACCTGGC CGGGACCGAG
TCCGAGGGTC CGATCGCGGT CGGCGGCGAC CTGTCCTTCG GCCGCGACTA CCGGGTGCGG
ATCCACAACG ACGGCACCTT CACCGACACG GGGGAGAGCA ACCCCGTGGC GCTGCTGGTC
CAGGGCGAGG TGGACTTCGA GGACAGCGAT CCGCAGGGTG TGCTCAGCAT CCCCACGGGC
GCCTACGTCA AGATCGGCGA CCTCGACGAC GCCGACGTCT ACACCACCGA CCTCAACAAC
GCCTCGGCCA ACACCCGGAT CGTCCCCCAG GGCGCCGGTT ACGAGGGCTA TCCGCGGGTC
CAGCTCAGCA CCATGCAGCC CGCCGACTCC GTGGGACCGG CGCCGGACCT GATCGACTTC
GAGGAGGCCT TCTCCGCCTT CCGCTCCTAC TCGTCCGGGT TCGCGCTCTG CTCCACCACG
GCCACGCTGA CCACCCCCAA CGGCGACCCG ATCGACCCGG ACGCCATCCC CGAGGGCGCG
GACCTGCACG TCAGCCTCAC CCCCGGAGAG CAGAACGTCC TCAACCTCAC CGCCGAGGAA
CTGAACTCGC TCTCCCAGCT GACCTTCGAC GACCCGCCGA GCCAGGACAC CCCGCTGATC
ATCAACGTCG ACACCTCGGG CGTCGGCGGC GACTTCACGT GGGAGGGCCC CACCGGCGCC
GGGTTGAGCT CCTCCACGGC CCCCTACACC CTGTGGAACT TCTCCGACGC CCGGACGGTC
ACCATCCCCG TGGGCACCGA CAGCATCGAG GGCACCGTCT ACGCCCCGAA CGCCGCGGTC
AACCACTACT CCTCGGCCAA CATCGAGGGA ACGGTGGTCG CCCGCGAGTT CACCCAGGAC
GTGGGCGGCC CGCGTGAGAG CCAGCGTGAC ACACGCGTCG TCCGGCCCTT CGAACTGCAC
TACCACCCGT TCGACGCCGA GCTGACCACG TGTGACGCCG GTCCGTCGCC GGACCCGGAC
GAGCCCACGG ACCCGGATGA GCCCACGGAC CCGGATGAGC CCACGGATCC GGACGAGCCC
ACGGACCCGG ACGAGCCCAC CGACCCCGAC GAGACGGACG ACCCCGACGA GACGGACGAC
CCCGACGAGC CCGGGGACCC GGACGAGCCC GGCCGTCCGG ACGAGACGGA CGACCCCGAC
GACCCGGCCC CGGGCTACCG GTAG
 
Protein sequence
MCIIDNAKLT RRSVPRRTGR TFLVTAATLA LAAGALSTAP GFRGWWAPAP ASAAVTVNPV 
APALGFNAIS EGDTYLAGTE SEGPIAVGGD LSFGRDYRVR IHNDGTFTDT GESNPVALLV
QGEVDFEDSD PQGVLSIPTG AYVKIGDLDD ADVYTTDLNN ASANTRIVPQ GAGYEGYPRV
QLSTMQPADS VGPAPDLIDF EEAFSAFRSY SSGFALCSTT ATLTTPNGDP IDPDAIPEGA
DLHVSLTPGE QNVLNLTAEE LNSLSQLTFD DPPSQDTPLI INVDTSGVGG DFTWEGPTGA
GLSSSTAPYT LWNFSDARTV TIPVGTDSIE GTVYAPNAAV NHYSSANIEG TVVAREFTQD
VGGPRESQRD TRVVRPFELH YHPFDAELTT CDAGPSPDPD EPTDPDEPTD PDEPTDPDEP
TDPDEPTDPD ETDDPDETDD PDEPGDPDEP GRPDETDDPD DPAPGYR