Gene Haur_2671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2671 
Symbol 
ID5734566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3424400 
End bp3425812 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content54% 
IMG OID641279813 
ProductDNA/RNA non-specific endonuclease 
Protein accessionYP_001545437 
Protein GI159899190 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1864] DNA/RNA endonuclease G, NUC1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0295423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGATC GTCGGCTGAT TACCCTGCTC GCGTTGTTCG TCCTTGGGTT TGTTGGGGCT 
AGCTTGGTAC AACCAACGCA GGCCAAAACC GTTAGTAGTG ATAACTTGGT CTTGGGCAAT
CCCAGCGGGG CCGTCGCCAG CAGCAGCTAT CCGACCAATT ATTTAATTCA ACGCAACCAA
TATGCGCTCT CGTATCACCG CGATAATGGG ATTGCCAATT GGGTCAGTTG GCACCTCGAT
AGCGGCGATA TTGGTAGCGT TTCACGCAGC GATTTTCAAA CCGACACTAG CTTGCCCAGT
GGTTGGTATC GCGTTGCCAC TGGCGATTAC AGCGGCAGCG GCTATGATCG CGGCCATATG
ACTCCCTCAG GCGATCGCAC CGCCACCACC GCCGATAACC AAGCCACCTT CTACATGACC
AACATTATTC CCCAAGCGCC CGATAACAAC CAAGGCCCAT GGGTTGACCT CGAAACCTAT
GCTCGCGAGT TGGTCAGCGC TGGCAACGAG TTGTATATTA TCAGCGGCGG GGCTGGCTCA
CGTGGCACAA TCGCCAGTGG CAAGGTGCGA ATTCCGAATT CAACCTGGAA AATCATCGTC
GTGCTTAGCC AAGGCAGTAA CGACCTCAGC CGCGTCAGCA ACAACACCCG CGTCATCGCG
ATCAACATGC CCAATGTGCA AGGTATTCGC GATAACGATT GGCGCGATTA TCTGACCACG
GTTGATGCTC TCGAAAGCTT GACGGGCTAT AACTTCCTTT CAAATGTCTC AACCAACATC
CAAAATGTGA TTGAAGCCCG CGTCGATGGC TCGACCACAC CAATTCCGAC AGCTGTACCA
ACCTCGGGAA CCAACCCAAC CGCCACTCCA GTACGCACGG CCACCCCAAC CCCCAGCACC
GGCTGTACAT CGAGCCGCCT GTTCTTCTCA GAATATGTCG AAGGCAGCAG CAACAACAAA
GCTTTGGAAC TTTACAATAA TACTGGAGCC AGCGTCAGCC TCAGTGGCTA TAGCATTCAG
TTGTATGCCA ACGGCTCGAC CAGCGCCAGC AGTAGCGTGA ATTTGAGTGG CTCGGTCGCC
AATGGCGCAA CCTATGTGAT TGCCAACGCC TCGGCATCAA GTAGCGTACA GAATCTTGCT
AACATCACCA GCAGTGTGGC CAACTTCAAT GGCAATGATG CACTTGTGCT GACCTACAAT
GGCACGGTGG TTGATAGCTT TGGCCAAGTT GGCAACGACC CAGGTAGCAG CGGTTGGGGT
GGCACAACCA CCGATCGCAC GTTGCGCCGT AAAGCAACAA TCAGCGCAGG CGATACCAAT
CGCAGCGATA GCTTCACCCC AAGCAGCACC TGGGATAGCT ATAGCCTTGA TACATTCAGT
GGCTTGGGCA ACCACAGTGT CAGTTGCCCA TAG
 
Protein sequence
MKDRRLITLL ALFVLGFVGA SLVQPTQAKT VSSDNLVLGN PSGAVASSSY PTNYLIQRNQ 
YALSYHRDNG IANWVSWHLD SGDIGSVSRS DFQTDTSLPS GWYRVATGDY SGSGYDRGHM
TPSGDRTATT ADNQATFYMT NIIPQAPDNN QGPWVDLETY ARELVSAGNE LYIISGGAGS
RGTIASGKVR IPNSTWKIIV VLSQGSNDLS RVSNNTRVIA INMPNVQGIR DNDWRDYLTT
VDALESLTGY NFLSNVSTNI QNVIEARVDG STTPIPTAVP TSGTNPTATP VRTATPTPST
GCTSSRLFFS EYVEGSSNNK ALELYNNTGA SVSLSGYSIQ LYANGSTSAS SSVNLSGSVA
NGATYVIANA SASSSVQNLA NITSSVANFN GNDALVLTYN GTVVDSFGQV GNDPGSSGWG
GTTTDRTLRR KATISAGDTN RSDSFTPSST WDSYSLDTFS GLGNHSVSCP