Gene Ndas_1063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1063 
Symbol 
ID9244909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1309369 
End bp1310787 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content77% 
IMG OID 
Productprotein of unknown function DUF405 
Protein accessionYP_003679011 
Protein GI297560037 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.397871 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGAT CCACGGACGA AGAGGAACAC GGCGGCGCCG GGGGCGCGGC ACGGTACCGG 
AGGCGACAGG ACGGCGACGG CCCCGGCGGC GCGGGACGCG CCCCGCGCGG CGGCGTCCTG
CCGGGTGAGC GCGCGCTGGC CCCGGACCTG GCGCGCGGGA TGATGCTGCT GCTCATCGTG
CTGTCCAACA CCGCGTTCCA CCTGTGGGCG GCCCAGCACG GACCCTCCGG CTGGCACCCG
GTGGACGGCT CGTGGCTGGA CCGGGCGGTC CGGTTCGCCA TGATCGTCGG CCTGGACCTG
CGCGCCTATC CCCTGTTCGC CTTCCTGTTC GGCTACGGGA TGATGCAGCT GTTCCTGCGC
CAGAGGGCGC ACGGGACCTC CGAACGCACG GCCGTGGCGC TGCTGCGCCG ACGCGGTCTG
TGGCTGGTCG TGTTCGGGTT CGCGCACGCC GCCCTGCTCA TGGCGGGCGA CATCATCGGC
TCCTACGGGG TGGCGAGCCT AGTGCTGGTG TGGCTGTTCA TCCGGCGCGG GGACCGGGTC
CTGCTGGCGG GGGCGGCGGT GTTCGCCCTG CTCATCGCGG TTCCCGCCGC ACAGGCGGCG
TGGAGCCTGG CGGCGCACGG GCTGGAGGGC GTCGGCGGCG CGGGGGCCGA GCCGTCCTAC
CTGGCCTACG CCGCCCAGGA GGAGGACCCG CTGGCGGCGG CGGGCACACG GCTGCTCACC
TGGGCGTTCG TCACCCTGGC GGGCGGCCTG CTGTCCTTCG GCGGGTTCGC CATGATCCTG
CTGGGCTTCT GGGCGGCGCG GCGGCGCGTC CTGGAGGAGC CGCACCGCCA CCTGCGGCTG
CTGGGCTGGA CGGCGGCGGT GGGGATACCG GTCGGCTGGC TGGGCGGGCT GCCGTCGGCG
CTGGTCCACG CGGGGCTGGT GGGGGCCGCC CCGGGCGCGG TCGGGGACGG GGGACCGCTG
CCCGCGGTCC AGGCCGCCAC GGGGCTGGCC TGCGGGCTGG GCTACGTCGC GCTGTTCGCC
CTGGCCGCGC ATGGACTGTC GCGACGGGGC GGCGTCGGCC GGACCGGCGC GGGCCGGGCG
GTGACCGCGG TCGCGGCGCT GGGCAGGCGC TCCATGTCGG GTTACATGGC CCACTCGCTG
CTCTTCTCCC CGCTGCTCGC CGCCTGGGGC CTGGGGCTGG GCGCCCACCT CAACAGCGCG
TCCATGGCGG CGTTCGCGCT GGCCGTGTGG CTGGTCACGG TCGTGGCGGC CTACGCCCTG
GAGCGCGCCG GCCGCCCCGG CCCGGCCCGG CCGAGTATCT GCTCCGTCGG CTCATCTACC
GCCGGACCCG CCCGGGGCGG CGGTAGACGC GCCCTCCCGG TTCCGGAACC CGGCCAGCTG
GTCGCGCAGG ACCCCGTAGG TGGGGGCCAG GTCCCGTAG
 
Protein sequence
MRGSTDEEEH GGAGGAARYR RRQDGDGPGG AGRAPRGGVL PGERALAPDL ARGMMLLLIV 
LSNTAFHLWA AQHGPSGWHP VDGSWLDRAV RFAMIVGLDL RAYPLFAFLF GYGMMQLFLR
QRAHGTSERT AVALLRRRGL WLVVFGFAHA ALLMAGDIIG SYGVASLVLV WLFIRRGDRV
LLAGAAVFAL LIAVPAAQAA WSLAAHGLEG VGGAGAEPSY LAYAAQEEDP LAAAGTRLLT
WAFVTLAGGL LSFGGFAMIL LGFWAARRRV LEEPHRHLRL LGWTAAVGIP VGWLGGLPSA
LVHAGLVGAA PGAVGDGGPL PAVQAATGLA CGLGYVALFA LAAHGLSRRG GVGRTGAGRA
VTAVAALGRR SMSGYMAHSL LFSPLLAAWG LGLGAHLNSA SMAAFALAVW LVTVVAAYAL
ERAGRPGPAR PSICSVGSST AGPARGGGRR ALPVPEPGQL VAQDPVGGGQ VP