Gene Ndas_1723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1723 
Symbol 
ID9245573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2093532 
End bp2095142 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content73% 
IMG OID 
ProductTAP domain protein 
Protein accessionYP_003679657 
Protein GI297560683 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00370292 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGAA GGCGGAGCGC GGGCCGGAGC CTGCTCCCGA TGGTGGCGGT ATGTCTGGCC 
GGTTCGCTGG TCGCCCCGGT GCCGGCGGCG GCCGACGAAC GGGGCGGGGC GCCGACGACG
GCACTGGAGT GGGGTGCCTG CCCCGAGGAC GTCCCGACCG GGACGTACCC GTTGGAGTGC
GCCACCGTGC CGGTGCCCGT GGACTACGAC GACCCCGACG GCGACCGGAT CGAGATCATG
GTGTCGCGGC TGGCCAGCAC GGAGCCGGAC CAGAGGCGCG GCGTGCTGCT GCTCAACCCG
GGCGGCCCGG GCGGGTCCGG GCTGTCGATG CCCGCGGACC TGGCCTCCCT GGGCGTGCCC
GCCAGCGTCC TGAACAGCTA CGACCTGATC GGCATGGACA CCCGCGGGGT CGGGCACTCC
TCCGCGGTGA GCTGCGGCTT CACCACCGAC CAGGAGTACT TCGGCAACAT CCCGCCCTAC
GCGGTCGACG AGACGGCGGT CACCGAGCAG GCCGCGGTCG CCGAGGGGGT CGCCGACCAG
TGCGCGCGCA GCGACGGGAA CGGGCTGATG CGCCACATCT CGACGGCCAA CATGGCCCGC
GACCTGGACC GGATCCGGGC CGCGCTGGGC GAGGAGGAGG CCAGCTTCTT CGGGCTGTCC
TACGGGTCGG CGCTGGGCGC GGCCTACGCC TCGATGTTCC CCGGGAGCAC CGACCGGATC
GTGCTCGACA GCAACATCGG CGACACCTTC CTCGACTACG ACGGCATGCG CAGGTTCGCC
CTGGGCTTCG AGGAGACCTT CCCCGACTTC GCCGCGTGGG CGGCGGAGCG CGACGGCAGC
TACGGCCTGG GCGGCTCTCC CGAACAGGTG CGGGAGACCT ACTTCGAGAC CGCCGAGCGG
CTGGACGAGG AGGGCCCGGT GGCCGGTGTC GACGGCCACG TCTTCCGGCT GGCCGTCTTC
GTGGGGCTGT ACAACGAGCG GAGCTACGGG CGGACGGCGC AGCTGTGGCA GTCGGTGCGC
CTCGCGGACG AGGAGGCGGT GCGGCGGCAC ATGGAGGAGA GCGGGCTCGG CGGCGCCGGG
GCGGCGACGG GCGAGCTGTG GCCCTCCGAC AACGCCTGGT CGGTGTTCCT CGCGACGACC
TGCAACGACG TCGAGTGGCC CGAGGACCTG GCCGCGTACC AGCAGGGCGT GGCCGAGGAC
CGGGAGAGGT ACCCGATGTA CGGCGCGGCC AGCGCCAACG TCATGCCCTG CGCCTACTGG
AACGACGCCC CGTCCGAACC CGCGGTGGAG ATCACCGGGG AGGGTCCGGC CAACGTCCTC
ATCATGCAGA ACCTGCGCGA CCCGGCCACG CCGCACCTGG GCGGAGTGCT GCTGGACGAG
AAGTTCGGTC AGCGTTCCCG GCTGGTGAGC GTCGACGACA GCGGTCACGG CGCCTACGTC
TACGGCGACA ACCCCTGCGC CTGGGACGTG GCCACGAGCT ACCTGGTCGA CGGCGAGTTC
CCGGAGCGGG ACGTCTCCTG CGGGGCGTCC GGGGCCTCCG GCCTGGGACT GGACGAGGAG
GTCCAGCGCT CGCGGACCCA AACCCTCGAC CGGCTCCAGG ACACCGCCTG A
 
Protein sequence
MGRRRSAGRS LLPMVAVCLA GSLVAPVPAA ADERGGAPTT ALEWGACPED VPTGTYPLEC 
ATVPVPVDYD DPDGDRIEIM VSRLASTEPD QRRGVLLLNP GGPGGSGLSM PADLASLGVP
ASVLNSYDLI GMDTRGVGHS SAVSCGFTTD QEYFGNIPPY AVDETAVTEQ AAVAEGVADQ
CARSDGNGLM RHISTANMAR DLDRIRAALG EEEASFFGLS YGSALGAAYA SMFPGSTDRI
VLDSNIGDTF LDYDGMRRFA LGFEETFPDF AAWAAERDGS YGLGGSPEQV RETYFETAER
LDEEGPVAGV DGHVFRLAVF VGLYNERSYG RTAQLWQSVR LADEEAVRRH MEESGLGGAG
AATGELWPSD NAWSVFLATT CNDVEWPEDL AAYQQGVAED RERYPMYGAA SANVMPCAYW
NDAPSEPAVE ITGEGPANVL IMQNLRDPAT PHLGGVLLDE KFGQRSRLVS VDDSGHGAYV
YGDNPCAWDV ATSYLVDGEF PERDVSCGAS GASGLGLDEE VQRSRTQTLD RLQDTA