Gene Ndas_2561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2561 
Symbol 
ID9246412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3051833 
End bp3053449 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content74% 
IMG OID 
Productprotein of unknown function DUF187 
Protein accessionYP_003680486 
Protein GI297561512 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.443654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.134171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCAG TGGCCCGTGC CGTGACCCCG ACGCGCGCCG CGCTGCGGTG GGCCGCGACG 
GCCGCCGCGG CCACCGTCCT GCTGGCGGGG TGCACCGGCC GGGGCGCCGA CGAGGAGCCA
CCGGCCGACG GCGCGTCCGG GGCGGCCGCC ACCCTGCGGT GCGGGGACGG CGGGGCGCAG
CGGCAGATGC GCGGGGCCTG GCTGACCACG GTCCGCAACA TCGACTGGCC CTCCGAGCCG
GGCCTGTCCG CCGAGGAGCA GAAGGCCGAA CTCGACGCCT ACCTCGACGA CGCCTCCGCC
ATGGGGCTCA ACGCGGTCTT CCTGCACGTG CGGCCCACCG CCGACGCCGT CTACGAGTCG
GACCTGGAGC CGTGGGCGCG CTACCTCACC GGCGAGCAGG GCGGCGACCC CGGCTACGAC
CCGCTGGAGT ACGCGGTGGC CGGGGCGCAC GAGCGCGGCC TGGAGCTGCA CGCCTGGTTC
AACCCCTACC GCGTGGGCTT CCAGGACCCC GACGTCGAGA ACCTGGCCGA GGACCACCCG
GCGAGGGAGA ACCCCGAGTG GCTCGTCGAC TACGGCGTCG AGGCCTACTT CGACCCCGGC
AACCCCGGGG TCCGCGAGTG GGTGACCCGC GTGATCCTCG ACGTGGTGGA GCGCTACGAC
GTCGACGGCG TGCACTTCGA CGACTTCTTC TACCCCTACC CCAAGGAGGG CGAGGAGTTC
GACGACGACG CCTCCTGGGA GGAGCACGGC GACGGCTTCG AGGCCCGCGA GGACTGGCGG
CGCGACAACG TCAACCAGCT CATCGCCGGG GTCCACGAGG GCATCGAGGC CACCAAGCCC
TGGGTGAGCT TCGGGATCTC CCCGTTCGGC ATCTGGCGCA ACGACTCCAC CGACCCCAGC
GGCTCGTCCA GCTCGGGCCT GCAGTCCTAC GACGCCCAGC ACGCCGACAC CCGCACCTGG
ATCCAGGAGG GCACGGTCGA CTACGTCGCG CCGCAGCTGT ACTGGGAGCG GGGCTTCGAG
ACCGCCGACT ACGAGGTCCT GGCCGACTGG TGGGCCCGGG AGGTCGAGGG CACCGGCGTG
GACCTGTACA TCGGCCAGGC CGCCTACCGG GTGGGCGAGG ACGGCTGGAC CGGCGAGGAC
GCGCTCAGCA CCCAGCTGGA CTACTCCGGC GAGCTGCCCC AGGTGGACGG CGACGTCTAC
TTCTCCATCA AGAACCTGCG CGAGCAGGCC GCCGACGCCT ACGCCGCGCT GGCCGACGAG
CACTACGGCG CCCCCGCCCT GCCGCCGCTC TCCGACGCGC CCGGGGGCCG GGGGCCGCTG
GTCGGCGCGG TCGGCGGCGT CACGGCCCGG GCCGCCGACG AGGGTGTCGA CGTGGCGTGG
GAGGCCGTGG ACGGCGCCCG CTTCTACGCC GTCTACCGGC TGCCCGCCGA CGCGGCCGCC
GGGTCCGAGG AGGAGCGCTG CGCCGCGGTC ACCGCGGACA ACCTCGTGGG CCTCACCGGG
CGGACCGCGC TGACCGACAC CGCGCCGCTG GAGGACGGCG CCGGGTACGT GGTCACCGCG
CTGGACGACT ACCGCGCCCA GGGACCGGTC AGCGAGGTCG CCGACGTGCG CGGCTGA
 
Protein sequence
MDSVARAVTP TRAALRWAAT AAAATVLLAG CTGRGADEEP PADGASGAAA TLRCGDGGAQ 
RQMRGAWLTT VRNIDWPSEP GLSAEEQKAE LDAYLDDASA MGLNAVFLHV RPTADAVYES
DLEPWARYLT GEQGGDPGYD PLEYAVAGAH ERGLELHAWF NPYRVGFQDP DVENLAEDHP
ARENPEWLVD YGVEAYFDPG NPGVREWVTR VILDVVERYD VDGVHFDDFF YPYPKEGEEF
DDDASWEEHG DGFEAREDWR RDNVNQLIAG VHEGIEATKP WVSFGISPFG IWRNDSTDPS
GSSSSGLQSY DAQHADTRTW IQEGTVDYVA PQLYWERGFE TADYEVLADW WAREVEGTGV
DLYIGQAAYR VGEDGWTGED ALSTQLDYSG ELPQVDGDVY FSIKNLREQA ADAYAALADE
HYGAPALPPL SDAPGGRGPL VGAVGGVTAR AADEGVDVAW EAVDGARFYA VYRLPADAAA
GSEEERCAAV TADNLVGLTG RTALTDTAPL EDGAGYVVTA LDDYRAQGPV SEVADVRG