Gene Ndas_0835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0835 
Symbol 
ID9244680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1026738 
End bp1027832 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content77% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003678785 
Protein GI297559811 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.788787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0618655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCA CCGGCCCCGC CTTCGAGGAC GCCGAGCGCC AGGTCGCCCT GGCGGGCGAC 
CCCGTGGAAC TGTTCGGCCC GCTGCCCGAG GACGGCGGCG TGCCCCCGGC GGCCACGGCC
CGCCACCGCG AGCTGGCCCG CGCGCTGCAC CCCGACACGG CGCCGCCCGG AAGCGGCACC
GGCCCCTTCG CCCGGCTCTC GGAGCTGTGG GACCTGTACC GGGCGATGGC CGCGGGCGAC
CTGCGTCTGG ACGACCTCAC CCTGGCCACC GGCGCGCACA CCTACCGGAT CGGCCGGGAG
CGGCTGGCCC GCGGAGACGT CGCCGACCTC CACCCGGTGC GCTACCGGGC GCCGGAGTGG
CGCGACGCGG TGCTCAAACT GCCCCGCGCA CCCCGCGACA ACGACCTGCT GGAGGCCGAG
GCGACCGCGC TGCGCCGCAT CCGGGAGCAC GGCCACGAGC GCTACCGGGC CTTCGTCCCC
GAACTGGTGG AGTCCTTCAA GCACCGCGAC GCCGCCACCG GCGTGGAGCG GCGGGCCAAC
GTCCTGGGGC GGCTGCACGG CTTCCACACG CTGGCCGAGG TGCGCCGCGC CCACCCCGAC
GGCGTCGACC CGCGCGACGC GGCGTGGATG TGGCGGCGGC TGCTGGTCGC CGTCGGCAAC
GCCGCCCTGG CGGGGGTCGT GCACGGCGCG GTCGTGCCCG AGCACGTGAT GATCCACCCG
GCCGAGCACG GCCTGGTCCT GGTCGACTGG TGCTACTCGG TGACGGCGCA CGCCCCGCGC
ACCGCGCCGC ACATCCCGGC GATGGTGCCC GGACGCGCGG ACTTCTACCC GCCCGAGGTG
GCCGCCCGCC GCCCCGCGCT GGCCCAGACC GACATCCACA TGGCGACCCG GTGCGTGGAG
TACGTCACCG CGGGCCGCCT GCCCCCGCAG CTGCGTTCCT TCGCGCGCGG CTGCACCCTG
CCCGCCCCAG AGCGGCGGCC CCGCGACGGG TTCGCCCTGC TCTGCGAACT GGACGACGTG
CTGGAACGCC TCTACGGGCC GCGCCGGTTC CGCCCCTTCA CCATGCCGGA CCCGGCACCG
GCCGCCGAGG TCTGA
 
Protein sequence
MTATGPAFED AERQVALAGD PVELFGPLPE DGGVPPAATA RHRELARALH PDTAPPGSGT 
GPFARLSELW DLYRAMAAGD LRLDDLTLAT GAHTYRIGRE RLARGDVADL HPVRYRAPEW
RDAVLKLPRA PRDNDLLEAE ATALRRIREH GHERYRAFVP ELVESFKHRD AATGVERRAN
VLGRLHGFHT LAEVRRAHPD GVDPRDAAWM WRRLLVAVGN AALAGVVHGA VVPEHVMIHP
AEHGLVLVDW CYSVTAHAPR TAPHIPAMVP GRADFYPPEV AARRPALAQT DIHMATRCVE
YVTAGRLPPQ LRSFARGCTL PAPERRPRDG FALLCELDDV LERLYGPRRF RPFTMPDPAP
AAEV