Gene Ndas_5098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5098 
Symbol 
ID9248988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp240877 
End bp242271 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content77% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682985 
Protein GI297564012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0121004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.708615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACGG ACACCTACAG CGAGGACGCC GCGCTCATCG GCGAACGCCA GGCCGCGGCC 
CGCGCCCTCC TGGCCCATCC CCTGCTCACC GAGCGCACCC ACCCCGCCGA GTTCGCCCTC
GTGCGCTCCC ACACCGAGTG GCTCGTCCAG CGCTTCCAAC GCGTGCTCGG CTACCGCCTG
ACCGTCGCCG AGGACCACGC CCGGCTGGTC AAACGCGGTC TCGTCACCGA GGTCGCGCGC
CCCCTCGCGC GCGGGACCGG CGCCCCCTTC ACCCCGCGCA CGCACACCTA CCTGGCGCTG
TCCCTGGCCG TCCTGGTCGA GGAGCAAGGC CCCACCACCG TCCGCGGCCT GGCCGCGCGG
GTCCGCTCCG CCGCGCACGA GGCCGGGGTC GACGCCGACC CCGAGCGCGG CCTCGCCGAG
CGCCGCGCCT TCTGCGCCGC CCTGCTCCAC CTGGTCTCGC TCGGCGCCCT CACCGAGGAC
TTCGGCACCA TCGCCGACCA CCGTGAGGAC CCCTCCGCCG ACGCGGAGCT GATCCCGCAC
ACGCAGGTCC TGCGCTCGGT CGCCGTCCAC CTGCCCCGCG CCTCCGACGA CCCCGACTCC
TTCCTCGCCG CCGCACGCGA CACCGACCCC GACGGCGACC ACCAGGGCGA GACCGCCCTG
CGCAGGCTCC TCGCAGAGAC CGCCGTCGTC TACCGCGAGG AGCTCCCCGA CCGCCAGCGC
GACCGCCTCG CCGCGCACCA GTGGCGGGCC GCAGCGGCGC TCGGCAACCT CCTGGGCTGC
GACACCGAGG TCCGCGCCGA GGGCGTCGCC CTCGTGATGC CCGACGAGGC GGGCGCCCGC
CCCGCCTTCC CCTCCGACGA CCCCGTCGGA CAGGTCGCCC TGGCCCTGGT CCGCCACCTC
TCCGGCCGCC TGCACCCCGG CCGCCCCGCC ACCTCCGCCC CGGTCCCGGA GGAGGAGATG
AACACCGCGC TGGAGGCCCT CTGCGGCGCG GACGCGCCCG CGCGGGCCGA GTGGGCCCGC
ACCGCGGGAC CCGAGATCCC CGACCCCGGA CGGCTCCGCG AGCGGGTCCT GGTCCTGCTG
GCCGACCTCG GACTGCTGCG GGGCGCCCCC GGACGGTGGC GGCTCACCGC CGCGGCCGCC
CGCTACGGGG CCGAAACGGA CATCCGTGTC CCCCCGATCC AGGACAATGA CGAAGACAGC
CGGACACATC CCGACCCGGT GCGTGCCCCC GGCGACGACG AGGGCGAACC CGACGCTCCC
GCCGACCCCC GCGGCGACCT GAGCCGGGTG GCGTCCGTGC TGCAAGCGGT GGCGAACGAG
AAGGTGAGCG CGACCAGTGG TGACTCAGGA GACCAGCCAG GCCCCGGAAG CGGCGGCGGC
GACGGGTCCG GCTGA
 
Protein sequence
METDTYSEDA ALIGERQAAA RALLAHPLLT ERTHPAEFAL VRSHTEWLVQ RFQRVLGYRL 
TVAEDHARLV KRGLVTEVAR PLARGTGAPF TPRTHTYLAL SLAVLVEEQG PTTVRGLAAR
VRSAAHEAGV DADPERGLAE RRAFCAALLH LVSLGALTED FGTIADHRED PSADAELIPH
TQVLRSVAVH LPRASDDPDS FLAAARDTDP DGDHQGETAL RRLLAETAVV YREELPDRQR
DRLAAHQWRA AAALGNLLGC DTEVRAEGVA LVMPDEAGAR PAFPSDDPVG QVALALVRHL
SGRLHPGRPA TSAPVPEEEM NTALEALCGA DAPARAEWAR TAGPEIPDPG RLRERVLVLL
ADLGLLRGAP GRWRLTAAAA RYGAETDIRV PPIQDNDEDS RTHPDPVRAP GDDEGEPDAP
ADPRGDLSRV ASVLQAVANE KVSATSGDSG DQPGPGSGGG DGSG