Gene Ndas_2278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2278 
Symbol 
ID9246128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2726561 
End bp2727883 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content69% 
IMG OID 
ProductHNH endonuclease 
Protein accessionYP_003680206 
Protein GI297561232 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.384645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.214719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCAG CACCCAAAGT GGCCCCCGGG GAATGTTCCC CGGCCGTGGC CGCGCTGGCC 
GGTGCGCGCG AGATCATCCA CCAAGCGTTG AACGCCGAAG TCCCGCCCGG GGCCGACGAG
ACCGCCGCGG AGGAGATCGC GGCGCTGTGG GCGCAGCTGG ACCAGATCCG GTATCAGGCG
TTGACGCAGA TGGCGCGCCT GTACGCACGT GGTGAGGTCG CCCGTTACAG CGGCTACAGC
ACGTTGGACA AGTGGATCAC CCATCGCTGC AAGGTCCCCA CTGCCCAGGC CAAGGATTTG
GCCCGTTTGG CCCAGCACGT GCAGGAGGAG ACGTTGCCCG CCACTGCCCA AGCAGTAGCT
GAGGGCAGGG TGGTGTTGGG TGAGGCGGTC GCGATCGCCA AAGCCACCGA CAAAGCCGTC
CAGACCCGCG ATGAGGACCA CTTTCCCGAT GAGGGGGAGT ACCGGCAGGG GTTCGAAGCG
GCTCTGGTGG CGGCCAAGGC GGAGCGGCCC GCGTTGTCGG TCAACCAGCT CCAGTCGGTG
GCCCGCCAGG TCGCCTACCG TTTGGACCCC CACCGCCTGG ACCGCGACCA CGAGGCCGCC
CATGCCGTCC GCGGGCTGGT AGTGCATGAC ACGTTCCAGG GCAGCTACCA ACTCCAGGCC
TGGGGTGGGT CCGGGGATGC GTTGGTCGTG CGCGCGGCCA TCGACACCTT CGACGTTTCG
CACTCGGATG AGGACACGCG CAGTCGTTCG CAGCGTGAGC ACGACGCGCT CATCGCGGCG
CTGCGTTTTG CCACCACCCA CACCGGATGC GGCAACGCTC CGGCTCCGTT GGCGCAGATC
CGCATCGTGG TGCCCGTGCA GACCTACCTG GACGCCCAAG GCCAGGAGGT TCCGGCGTTG
GACGAGCACG GTCGGGTGAT CCCAGCCGGT GTGGTCCACG AACTGGCCGC CGATTCCGAG
GTGGTGCGGA TGCTCACCGC ACCCCCCACC GGACAGGTGC TGGACGTGGG CCACAGCCGC
CGCCTGGCCT CAACCCGCCA ACGCACCGCC GCCTTCCACG GACACGCCAC CTGCGCCCAC
CCGGGCGGAT GTGAGGTACC GGTGGCGTTG TGCCAGGCCG ACCACGTCAC CTCGTTCTCC
CGGGGCGGGC GCACGGTGGT GGCCAACCTG CAACCGTTGT GCGGGCCGCA CAACCGGGCC
AAGTACCAAC GCGAACTCCG CACACACCAC CACCAGGGAC AAGGACAAGG GCGGGGATGG
GGACGAGGAC GGGGAGGAGA CCACCCACCC GGCAGGGATC CGGATCCACC ACCCCGGAAA
TGA
 
Protein sequence
MIAAPKVAPG ECSPAVAALA GAREIIHQAL NAEVPPGADE TAAEEIAALW AQLDQIRYQA 
LTQMARLYAR GEVARYSGYS TLDKWITHRC KVPTAQAKDL ARLAQHVQEE TLPATAQAVA
EGRVVLGEAV AIAKATDKAV QTRDEDHFPD EGEYRQGFEA ALVAAKAERP ALSVNQLQSV
ARQVAYRLDP HRLDRDHEAA HAVRGLVVHD TFQGSYQLQA WGGSGDALVV RAAIDTFDVS
HSDEDTRSRS QREHDALIAA LRFATTHTGC GNAPAPLAQI RIVVPVQTYL DAQGQEVPAL
DEHGRVIPAG VVHELAADSE VVRMLTAPPT GQVLDVGHSR RLASTRQRTA AFHGHATCAH
PGGCEVPVAL CQADHVTSFS RGGRTVVANL QPLCGPHNRA KYQRELRTHH HQGQGQGRGW
GRGRGGDHPP GRDPDPPPRK