Gene Ndas_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3201 
Symbol 
ID9247058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3829056 
End bp3830342 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content69% 
IMG OID 
ProductHNH endonuclease 
Protein accessionYP_003681115 
Protein GI297562141 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCAG CACCCAAAGT GGCCCCCGGG GAATGTTCCC CGGCCGTGGC CGCGCTGGCC 
GGTGCGCGCG AGATCATCCA CCAAGCGTTG AACGCCGAAG TCCCGCCCGG GGCCGACGAG
ACCGCCGCGG AGGAGATCGC GGCGCTGTGG GCGCAGCTGG ACCAGATCCG GTATCAGGCG
TTGACGCAGA TGGCGCGCCT GTACGCGCGC GGTGAGGTCG CCCGCTACAG CGGCTACTCC
ACGCTGGACA AGTGGTTGGT GCACCACGGC ACGATGCCCA CCGCTCAGGC CAAGGACCTG
GCTCGTCTGG CCCAGCACGT GCACGAGGAG ACACTGCCCG CCACTGCCCA AGCAGTAGCT
GAGGGCAGGG TGGTGTTGGG TGAGGCGGTC GCGATCGCCA AAGCCACCGA CAAAGCCGTC
CAGACCCGCG ACGCCCAGCA TTTCCCCGAT CAGGGGGAGT ACCGGCACGG GTTTGAGTCC
GCCCTGGTGG CCGCGAAGGC GGAGCGGCCC GCGTTGTCGG TCAACCAGCT CCAGTCGGTG
GCCCGCCAGG TCGCCTACCG TTTGGACCCC CACCGCCTGG ACCGCGACCA CGAGACCGCC
CATGCCGCCC GCGGACTGAC GGTGCATGAC ACGTTCCAGG GCAGCTACCA ACTCCAGGCC
TGGGGCGGCA GTGGGGATGC GTTGGTCGTG CGCGCGGCCA TCGACACCTT CACCACCCCA
CCCGGGGAAG GTGACACCCG GTCCCGGTCC CAGCGTGAGC ACGACGCGCT CATCGCGGCG
CTGCGTTTTG CCACCACCCA CACCGGATGC GGCAACGCTC CGGCTCCGTT GGCGCAGATC
CGCATCGTGG TGCCCGTGCA GACCTACCTG GACGCCCAAG GCCAGGAGGT TCCGGCGTTG
GACGAGCACG GTCGAGTGAT TCCGGTCGGG CTGGTGCACG AGTTGGCCGC CGATTCTGAG
GTGGTGCGGA TGCTCACCGC ACCGCCCACC GGGCAGGTGT TGGATGTGGG CCACAGCCGC
CGCCTGGCCT CAACCCGCCA ACGCACCGCC GCCTTCCACG GACACGCCAC CTGCGCGCAC
CCGGGCGGAT GTGAGGTACC GGTGGCGTTG TGCCAGGCCG ACCACGTCAC CTCGTTCTCC
CGAGGAGGGC GCACCGTGGT CGCCAATCTC CAACCGTTGT GCGGGCCGCA CAACCGGGCC
AAGTACCAAC GCGAACTGCG CACACACCGA CAGCGGGAAC GGCATCATCC GCCCGACAGG
ATTCCGGTTC CACCGCCCCG GGAATGA
 
Protein sequence
MIAAPKVAPG ECSPAVAALA GAREIIHQAL NAEVPPGADE TAAEEIAALW AQLDQIRYQA 
LTQMARLYAR GEVARYSGYS TLDKWLVHHG TMPTAQAKDL ARLAQHVHEE TLPATAQAVA
EGRVVLGEAV AIAKATDKAV QTRDAQHFPD QGEYRHGFES ALVAAKAERP ALSVNQLQSV
ARQVAYRLDP HRLDRDHETA HAARGLTVHD TFQGSYQLQA WGGSGDALVV RAAIDTFTTP
PGEGDTRSRS QREHDALIAA LRFATTHTGC GNAPAPLAQI RIVVPVQTYL DAQGQEVPAL
DEHGRVIPVG LVHELAADSE VVRMLTAPPT GQVLDVGHSR RLASTRQRTA AFHGHATCAH
PGGCEVPVAL CQADHVTSFS RGGRTVVANL QPLCGPHNRA KYQRELRTHR QRERHHPPDR
IPVPPPRE