Gene Ndas_1508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1508 
Symbol 
ID9245358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1847819 
End bp1849129 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID 
ProductHNH endonuclease 
Protein accessionYP_003679444 
Protein GI297560470 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000388947 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00249472 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGATGA CGACCGCAGC GCCCGAGGCG GACCCCGGGG AATGTTCCCC GGCCGTGGCC 
GCGCTGGCCG GTGCGCGCGA GATCATCCAC CAGGCGTTGA ACGCCGAGGT GCCACCCGGG
GCCGACGAGG CCGCCGCTGA GGAGATCGCG GCGCTGTGGG CTCAGCTGGA CCAGATCCGG
TATCAGGCGT TGGCACAGAT GGCGCGTTTG TACGCACGTG GTGAGGTCGC CCGCTACAGC
GGCTACTCCA CGCTGGACAA GTGGATCACC CACCACTGCA AGGTCCCCAC CGCTCAGGCC
AAGGACCTGG CTCGTTTGGC CCAGCACGTG CAGGAGGAGA CACTGCCCGC CACTGCCCAA
GCAGTAGCTG AGGGCAGGGT GGTGTTGGGT GAGGCGGTCG CGATCGCCAA GGCCACTGAC
AAGGCCGTCC AGACCCGCGA TGAGGACCGC TTTCCCGATG AGGTGGAGTA TCGGCACGGG
TTTGAGTCCG CCCTGGTGGC CGCGAAGGCG GAGCGGCCCG CGTTGTCGGT CAACCAGCTC
CAGTCGGTGG CCCGCCAGGT CGCCTACCGT TTGGACCCCC ACCGCCTGGA CCGCGACCAC
GAGGCCGCCC ATGCCGTCCG CGGGCTGACG GTGCATGACA CGTTCCAGGG CAGCTACCAA
CTCCAGGCCT GGGGTGGGTC TGGGGATGCG TTGATCGTGC GCGCGGCCAT CGACACCTTC
GACGTTTCGC ACTCGGATGA GGACACCCGG TCCCGGTCCC AGCGCGAGCA TGACGCGCTC
ATCGCGGCGC TGCGTTTTGC CACCACCCAC ACCGGATGCG CCAACGCTCC GGCTCCATTG
GCGCAGATCC GCATCGTCGT GCCCGTGCAG ACCTACCTGG ACGCCCAAGG CCAAGAGGTT
CCCGCGTTGG ACGAGCACGG TCGGGTGATC CCAGCCGGTG TGGTCCACGA ACTGGCCGCC
GATTCTGAGG TGGTGCGGAT GCTCACCGCA CCCCCCACCG GACAGGTGCT GGACGTGGGC
CACAGCCGCC GCCTGGCCTC AACCCGCCAA CGCACCGCCG CCTTCCACGG ACACGCCACC
TGCGCCCACC CGGGCGGATG CGAAGTGCCA GTCGCCCTCT GCCAAGCCGA CCACGTGCAG
TCGTTCTCCC GAGGCGGACG CACCGTGGTC GCCAATCTCC AACCGTTGTG CGGGCCGCAC
AACCGGGCCA AGTACCAACG CGAACTGCGC ACACACCACC AAGGACGGCG GCGGGGACGG
GGAGGAGACC ACCCACCCGG CGGGGAGCCG GATCCACCAC CCCGGGAATG A
 
Protein sequence
MEMTTAAPEA DPGECSPAVA ALAGAREIIH QALNAEVPPG ADEAAAEEIA ALWAQLDQIR 
YQALAQMARL YARGEVARYS GYSTLDKWIT HHCKVPTAQA KDLARLAQHV QEETLPATAQ
AVAEGRVVLG EAVAIAKATD KAVQTRDEDR FPDEVEYRHG FESALVAAKA ERPALSVNQL
QSVARQVAYR LDPHRLDRDH EAAHAVRGLT VHDTFQGSYQ LQAWGGSGDA LIVRAAIDTF
DVSHSDEDTR SRSQREHDAL IAALRFATTH TGCANAPAPL AQIRIVVPVQ TYLDAQGQEV
PALDEHGRVI PAGVVHELAA DSEVVRMLTA PPTGQVLDVG HSRRLASTRQ RTAAFHGHAT
CAHPGGCEVP VALCQADHVQ SFSRGGRTVV ANLQPLCGPH NRAKYQRELR THHQGRRRGR
GGDHPPGGEP DPPPRE