Gene Ndas_3111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3111 
Symbol 
ID9246967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3725762 
End bp3727093 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content75% 
IMG OID 
Productprotein of unknown function DUF418 
Protein accessionYP_003681026 
Protein GI297562052 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.414343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGGC ACACCGTTTC CCGCGGTCCG GTCGGCGCGG GGGAGAGAGC CCTCGCACCG 
GACCTGGCGC GCGGGTTCAT GCTGCTGTTC ATCGCGCTCG CCAACACGGC CTGGTACCTG
TGGGCGCTCC CGTCCAGCGG GGGCGGCGTC CACCCCGAGC CGGGCGGCGT ACTGGACCGG
ATCGCGCAGT TCGTCATCGT CACGGCGGTG GACATGCGCA GCTATCCGAT GTTCGCCTTC
CTGTTCGGGT ACGGCATGGT GCAGCTGGCC CGGCGCCAGG AGGCGGCGGG CTCCTCCGCG
CGGGAGGTGA ACGCGCTGCT GCGCCGCCGC AACCTGTGGC TGCTCGCGTT CGGTTTCGTC
CACGCCCTGC TGCTGTGGAT GGGCGACGTC CTGGGCGCCT ACGGCCTGGC AGGGCTGCTC
CTGGGCTGGC TGTTCCTGCG ACGCAGGGAC GCCACGCTGC TGGTGTGGTC GGGCGTGTTC
ACCGGGCTGG CCGCCCTGCT GGCGGCGTTC AGCCTGCTGG GTCTGGCCTC CATGCCCGCG
GAGGCCTCCT ACTCCTCCTC GGCCGCGTTC AGCACGGACC TGATGGCCGA CAACATCGGC
GACACCTCGA TCCTGGGCGC CGCGCTGGCC CGCGTCCTGG ACTGGCCGCT GGTCACGCTG
GGCCAGGGGC TGCTGGGCAT GGTCGTGCCC GCGGCGATCC TGCTCGGCTA CTGGGCCGCG
CGCCGCCGGA TCCTGGAGGA GCCGGGCGGG CACCTGGGCC TGCTGCGCTG GACCGCGGCC
CTGGGCATCG GCGTGGGCTG GCTGGGCGGG CTGCCCCTGG CCCTGACCCA GATCGGGGTC
TGGGAGCTGT CCCCCGCGCA GGCGGCCATG CTCACCATGC CGCACATGGT GACCGGGCTG
GCCTGCGGCC TGGGCTACGT GGCGCTGATC GCGCTGGCGG CGCACCGGAT CCAGGGGCGC
GGTCGCGCAC CGGGCGCGGT GGTCGGCGCG CTGTCGGCGA CCGGCAAGCG CTCGCTGTCG
GCCTACCTGG CCCAGTCGGT GCTGTGCGCG CCCCTCCTGG CGGCCTGGGG CCTGGGACTG
GGCGGCGAAC TCGCCTCGTG GTCGATGGCC CTGTTCGCGG TGGGCGTGTG GCTGGTGACG
GTGGCGGCGT CCTACGCCCT GGAGCGCGCG GGCAGGCGCG GACCGGCCGA GGTGCTGCTG
CGCCGCCTGG CCTACCGCCG CCCGGTCGCG CGGGGCGCTC GGGAGTCGGA GGAGCCCGGC
AGCGCGTCGG TTCGCACGGT CGGGCGCGGA CGGGCACGGG GACCGGCGGA CGGAGAGCGT
CCCGCGCCCT GA
 
Protein sequence
MSGHTVSRGP VGAGERALAP DLARGFMLLF IALANTAWYL WALPSSGGGV HPEPGGVLDR 
IAQFVIVTAV DMRSYPMFAF LFGYGMVQLA RRQEAAGSSA REVNALLRRR NLWLLAFGFV
HALLLWMGDV LGAYGLAGLL LGWLFLRRRD ATLLVWSGVF TGLAALLAAF SLLGLASMPA
EASYSSSAAF STDLMADNIG DTSILGAALA RVLDWPLVTL GQGLLGMVVP AAILLGYWAA
RRRILEEPGG HLGLLRWTAA LGIGVGWLGG LPLALTQIGV WELSPAQAAM LTMPHMVTGL
ACGLGYVALI ALAAHRIQGR GRAPGAVVGA LSATGKRSLS AYLAQSVLCA PLLAAWGLGL
GGELASWSMA LFAVGVWLVT VAASYALERA GRRGPAEVLL RRLAYRRPVA RGARESEEPG
SASVRTVGRG RARGPADGER PAP