Gene Ndas_4874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4874 
Symbol 
ID9248761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp5727 
End bp7097 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content75% 
IMG OID 
Productprotein of unknown function DUF418 
Protein accessionYP_003682763 
Protein GI297563790 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.93625 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGG CGGGGTACGA CACGACAGGG GCACAACGGC ACGGGGACGG GACGCACCAC 
CCGGTCGGTG CCGGGGGGCG GTACACGGCC GGGGCACGGC ACGAGGGCGG GGCGCTCCAC
ACGGCCGGGA GCGGGGAACC GCGCACGGCG GGGACCGAGG CGCGGCACAC GGCCGGGGCA
CGGCGCGCAG CCGCGTCCGA CGGTGCCGGT CCCGGCCTGC CGTCCCCCGC GCGGCGCGTG
GTGTTCCTGG ACGTCGCCCG GGCGCTCGCG ATGATCGGCG TGGTCATGAT GAACGCGTCC
TCGGTGGTCT ATCCCGTCGA GATGTCGGGC GACCGGGTCC CGGGCGTGCT GAGCGAACTC
GTGGACGGCG GGCTCACCCT GCTCATGTCG GGCAGGGCCA GGACCATGCT CATGGTGCTG
CTGGGCGCGG GCGTGGTCCT GGCGTGGCGG GCCGCCGCCC GGCGCGGCGG GAGTCCGGCG
GCGGTGATGC TGCGCCGCTA CGCCGTCCTG GGCCTGCTGT TCGGACTTCC GCACCTGGCG
GTCTTCGACG GGGACATCCT CACCCAGTAC TCGGTCGCGG CGCTGCTGCT GACCCCGCTG
GTGCCGCTGC TGCTCGGCGG GTCCCGCCGC AGACCGCTGG TCGCGGCGGC CGTGCTGTTC
GCCGCCGTGC CGGTGTCGGA CCTGCTGCTC TCGCCGTTCC TCGGGGACCA CACCTGGGGG
GCCTCGGCGA TGCTGGTCCC GCAGACCCTC GGGTTCTTCT GCGTGGGCGT GTGGCTGGCC
CGCCGCCCGG AGCTCACCGC CGAGCCCGGA ACCGGTGCGG AGGGGACCTC CCGCCTGCCG
CTGCGCATGC TCGTGTTCGG CGCGGTGGTC CAGGTCCTGA GCGTGGCCCT CATGCTCGTC
GGCAGCGTCG TGTTCCCGAC CGAGTTCGGC GCCGACGGCG CGCCGGTGCG GTCCGTGGGC
GAGACCGTGG TCGTCCTCCT GGGGAACACC TTGCTGAACC TGGGCGGAGC CCTGCTCTAC
CTGGGACTGG TGTGGTGGCT GGTACTCAGG GGACGGGGCG CGGCGCGCGT GCTGGGGACC
CTGGCCCCGC TCGGGCGGAT GACCCTCACG GTGTACCTGG GCAGCACGGC GGTGTTCCTG
GCGGTCATGG GCCCGTTCGA GGGGACGGTC CCCCAGCTGG CCCAGTACGC CCTGGCCGCC
GCCTACTTCG TCGCGACCGC CGTCCTGGCC CACCTGTGGG CGCGCCGGTT CCGGCTCGGT
CCGCTGGAGT GGGTGTGGCG GAGCCTGACC CACCTGCGCC CGGTGCCCCT GCGCGCCGAG
CGCTCCGGTC GGCGGTCCGG CCTCCCGGCC GCCAGCGGGG ACCCGGCCTG A
 
Protein sequence
MSEAGYDTTG AQRHGDGTHH PVGAGGRYTA GARHEGGALH TAGSGEPRTA GTEARHTAGA 
RRAAASDGAG PGLPSPARRV VFLDVARALA MIGVVMMNAS SVVYPVEMSG DRVPGVLSEL
VDGGLTLLMS GRARTMLMVL LGAGVVLAWR AAARRGGSPA AVMLRRYAVL GLLFGLPHLA
VFDGDILTQY SVAALLLTPL VPLLLGGSRR RPLVAAAVLF AAVPVSDLLL SPFLGDHTWG
ASAMLVPQTL GFFCVGVWLA RRPELTAEPG TGAEGTSRLP LRMLVFGAVV QVLSVALMLV
GSVVFPTEFG ADGAPVRSVG ETVVVLLGNT LLNLGGALLY LGLVWWLVLR GRGAARVLGT
LAPLGRMTLT VYLGSTAVFL AVMGPFEGTV PQLAQYALAA AYFVATAVLA HLWARRFRLG
PLEWVWRSLT HLRPVPLRAE RSGRRSGLPA ASGDPA