Gene Ndas_1780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1780 
Symbol 
ID9245630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2178625 
End bp2180091 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content75% 
IMG OID 
ProductPepSY-associated TM helix domain protein 
Protein accessionYP_003679714 
Protein GI297560740 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.40116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACAGG AAGACAGCGC GCCCCCGCAG GACGGGACGC GCGCGGAGCC GGACCGGCGC 
GGCACCTGGG CGGCGCTGCG CCCCCTGGTG CTGCGCCTGC ACTTCTACGC GGGGGTCCTC
GTCGCCCCCT TCATCCTGGT CGCCGCCGTG TCGGGGCTGC TGTACGTGTG GACGCCCCAG
ATCGAGCAGG CGGTCTACGC CGAACAGCTG CGGGTGGAGC CCTCCGGCGA ACCGGTCCCG
CTCCACACAC AGGTGCGCGT CGCCCAGGAG GAGCTGCCGG GAGCCGAACT CGACGCGGTG
CGACCGGCCA CCGGCGCGGA GGACTCCACC CGGGTGCTGT TCGACGTGCC GGGGCTGGAG
GCCAGCCACA GGACGACGGT GTTCGTCGAC CCCTACGGCG GCGAGGTGCT CGGGGTGATG
GAGACCTACG GCACCAGCGG CGCCCTCCCG GCCCGCACGT GGGTCGACAC CCTGCACCGC
AGCCTGCACC TGGGCGACGT CGGACGCCTG TACAGCGAGC TCGCGGCGAG CTGGATGTGG
GTGGTCGCCC TGGGCGGTGT CGCCCTGTGG GTCGCGCGCA ACCGGCGTCC GCGCGGCGGT
CTCCGCCGCC TGCTGCTGCC CGGCGCGGGC ACCTCCGGGG GACGGTCCAG GTCGGTCTCG
CTGCACGGAG CCGCCGGTCT GTGGCTTCTG GTGGGCCTGC TGTTCCTGTC GGCGACGGGG
ATGACGTGGT CGCAGTACGC GGGGGCCAAC ATCAGCGACC TGCGCGAGCG GCTGGACTGG
AGCACCCCCG CGGTGTCCAC GGAGGCGCCG GTGGCCTCGC CCGGGGTGGA CGCGGGAGTG
GGCGCCGTCC TGGCCAGTGC GCGCGAGGCG GGCCTGGACG GCCCCGTCGA GGTGGCCCTG
CCCGAGGACC ACACCTCCCC CTACGTGGTG AGCCAGATCG ACCGCGGTTG GCCGACGCGG
GTGGACTCGG CCGCCGTCGC CCCGGATACG GCCGAGGTCA CCGACGTGGT CCGCTTCGCC
GACTACCCGG TCATGGCCAA GCTGAGCCGG TGGGGGATCG ACGCCCACAT GGGTGTGCTG
TTCGGCGTGC CGAACCAGTT GGTGCTGTCC GCCCTGGCCT CGGGGCTGAT CGCCGTCATC
GTGCTGGGCT ACCGCATGTG GTGGCAGCGG CGCCCCACAC GGTCCCGAGT CCTGGGCGTG
GGACGCCCCT ACCCGCGCGG CTCCCTCACG GCCCTGTCCC CGCTGTCCAG GGTCGCGGCG
GTGGCGGTCC TGGCCCTGGT CGGCTGGGCG GCGCCGCTGC TGGGCGCCTC GCTGCTGGTG
TTCCTGGCCG TGGACGCCGT CCTCGGGTGG CGGGCCCGGT CACGTGCCTC AGGGGGCTCC
GCGCCGGGCG GCCGGGCTCC GGGTGCCGAC GGGCAGGCCG GACCCGAGTC CGCGGGCGCG
CCCGCGTCCG GGGCGCGGAA CCTGTGA
 
Protein sequence
MGQEDSAPPQ DGTRAEPDRR GTWAALRPLV LRLHFYAGVL VAPFILVAAV SGLLYVWTPQ 
IEQAVYAEQL RVEPSGEPVP LHTQVRVAQE ELPGAELDAV RPATGAEDST RVLFDVPGLE
ASHRTTVFVD PYGGEVLGVM ETYGTSGALP ARTWVDTLHR SLHLGDVGRL YSELAASWMW
VVALGGVALW VARNRRPRGG LRRLLLPGAG TSGGRSRSVS LHGAAGLWLL VGLLFLSATG
MTWSQYAGAN ISDLRERLDW STPAVSTEAP VASPGVDAGV GAVLASAREA GLDGPVEVAL
PEDHTSPYVV SQIDRGWPTR VDSAAVAPDT AEVTDVVRFA DYPVMAKLSR WGIDAHMGVL
FGVPNQLVLS ALASGLIAVI VLGYRMWWQR RPTRSRVLGV GRPYPRGSLT ALSPLSRVAA
VAVLALVGWA APLLGASLLV FLAVDAVLGW RARSRASGGS APGGRAPGAD GQAGPESAGA
PASGARNL