Gene Haur_0419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0419 
Symbol 
ID5732318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp488839 
End bp491997 
Gene Length3159 bp 
Protein Length1052 aa 
Translation table11 
GC content51% 
IMG OID641277545 
Productendonuclease/exonuclease/phosphatase 
Protein accessionYP_001543198 
Protein GI159896951 
COG category[R] General function prediction only 
COG ID[COG2374] Predicted extracellular nuclease 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0131268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAATTTA AGGGTTTACG ACTCCTAACA GCGTTAACGA TGCTGTTCAC GTTGATTGGT 
ACTTCGTTGT CCGCAACATA CACCACTCCA ACCAGTGCTG TATCAACCTC CCTTGTCATT
AGTCAATTCC AAACCGCTGG CGGTACTGCC GATGATGAAT TTATCGAAAT TCACAATATC
AGTGCCAATC CAGTCGATTT GAATGGGCAT CGCGTCGTTT ATCGTGCTGC TGCTGGGGTA
ACTGATGTTT CAATTGCCAG TTGGACAAGC TCGGTTGTAA TTCCTGCTGG TGGTTTCTAT
CTCTTAGGCC GTGCGAACTC ATATGATGGT ACGGTTACCG CTGATACAAC CTTTGGTAGC
GGTATTTCTG GTACGGGTGG GGGCTTCGCT ATCCGTAACG GTGATTTGAA TACTGGCACA
ATCATCGATT CAGTTGGCTT TGGTAGTGCG ACGAATGCCT TTGTTGAAGG CACGGTTGTA
GCTGCTCCGA GCGCCAACAA TAGTGCTCAA CGCAATAATT CAAGTTGTAC TGATACCGAC
AATAATGCTG CCGATTTCAG TTTGTTGACT CCTTCAGCTC CTCGCAATAG CAGCAGTCCA
GTTGCAACCT GTGGTGGTTT GCCAACCGAT AATCCACCAA CTGTTGCCAG CACTGTGCCA
GCCAATAATG CCAACAATGT GGCCTTGAAT GCGAACGTCA CCCTTACGTT CAGCGAGCCA
GTCACCACGA CGGGCACTTG GTACACCATT AGCTGTACCA GTGGCGCTCG CACCGCCAGT
GTTACTGGTG GCCCAACCAG CTATGTGCTT GATCCCAGCA GCGATTTTGC TGCCAGCGAT
GCTTGTACGG TAACGGTTGT TGCCAGCCAA GTAACCGACC GCGATGGTAG CGTTGATGCT
ATGGCCCAAG ACTATAGCTT CAACTTCAAT GTTTCGAGCG GTGTCGCCTG TTCAGGTGGC
ACGGTTACCC CAATCGGCCA AGTGCAAGGC ACTGGCGAAA CCAGCCCAGC AAACGGTACG
GTTGTTACCG TTCAAGGGAC GGTGGTAGCC GATTTTGAAG GTGCTCAGCC TGCTTTGCGT
GGCTTCTACC TGCAAGATGC TGGCGATAGC AACGCTGCAA CTTCCGATGG TATTTTTGTA
TTCAACGGCG GCGCTGATCA AGTTGCGCTT GGTCAAGTGG TGCGCGTAAC TGGTACGGCT
GGCGAAAACC AAGGCCAAAC CCAATTGAGC GGAAGCTTGA CGGTTGAGCT TTGTGGCACA
ACCAACACGG TTACCCCAAC CCAAGTTACC TTGCCATTTG CCTCAACCAC CGATGCCGAG
CGCTTTGAAG GTATGTTGGT TCAATTCAAC CAAAAGCTGG TTGTTTCAGA GCACTACTTG
TTGGCTCGCT TCGGCCAAGT AACCTTATCG CTCAACGAAC GCTTGGCACA ACCAACCAAC
GTGGTTTCAC CAGGCGCTCC AGCCTTGGCA TTGCAAGCAC AAAATAACTT GCACAAAATC
ATCCTCGATG ATCCATTCAA CAACCAAAAT GCCGACCCAA TTCTGTTTGG TCGTGGTGGC
ACTGGTTTGA GCGCCGCCAA TACCCTTCGC GCTAGCGATA GCATCACTGG GTTGCAAGGT
GTGATGACCT ACACGTGGGG TGGTAATAGC GCTAGCCCCA ACAGCTACCG CGTGCGCGTA
ACCCAATCGC CAAACTTCGT AGCCGAAAAC CAACGCCCAA CCTCGCTTGA GCTTGATGGC
TCATTACGGG TTGCGGGAAT CAACGTGTTG AACTATTTCA ACACCTTTGG CAATGGCAAC
TGTACTGGCG GGGTTGGTGG TGCTGCTACC GATTGTCGTG GTGCTGAAAA TACCACCGAA
TTCCAACGCC AAAAAGATAA AACCATTGCT GCGATCATCA TGACTCAAGC TGATGTGATT
GGCTTGATGG AAATTGAAAA TGATGGCTTC GGCAGCAATA GCGCAATCCA AGATTTGGTT
AATGGCTTGA ACGCAGCCAC TGCACCTGGT ACCTATGCCT TGGTGGATGT CGATACACGC
ACAGGCCAAA CCAATCCATT AGGTACTGAT GCAATTAAGG TTGCCTTACT CTACAAGCCT
GGCAAGGTTA GCTTGGCTGG TACAACCGCC GTACTCAATA CTGGGGCATT CGGCGATTTC
ACCACCAGCA GCGGCACAAC TGGCCGTAAT CGCCCAGCTT TGACTCAATC GTTTACCGAA
ACCAGCAGCG GCGAAACCTT CACCGTGGTG GTCAATCACC TTAAGTCGAA GGGTAGTGCT
TGTACCGATA ACGTCAGCCC AGTGCCAAGC GACCCCGATT TGGGCGATGG TCAAGGCAAC
TGTAACCTGA CCCGCAAAAC TGCTGCCCAA GAATTAGTGG CTTGGTTGAA TACCGACCCA
ACCAATGTTG ATGATAGCGA TTATCTGATC ATTGGCGACT TGAACTCATA CGCCATGGAA
GATCCAATCA CCGCTATTCG CAACGCTGGC TATGTCAATA TTCCCAAAAC CTTGCTTGGC
GACGAAGCCT ACTCATACAT TTTCGATGGC CAAACTGGTT CGCTTGACCA TGCCTTGGCT
AGCACTTCGT TGTTCAGCCA AGTTGCTGAT GTTCAAGAAT TGCACATCAA TGCCGATGAA
CCAATCGCCT TGGATTACAA CACCAACTTC AAGAGCGCTG GGCAAATCGT CAGCTTGTAC
AATAACGATG CCTATCGTGC TTCCGACCAC GATCCAATTG TGGTTGGTTT AGCCTTAGGC
GCGGCTCCAA CCGCCAACTT CAGCACTTCA ACCAAAACCG TTAGCAGCAC CAGCGTCGAA
CAAGGCGAGG TCTTCACTTA CACCTTGACG ATTCGCAACA CGGGTGCAGC TGAAGGCAGC
TTTAGCTTGA GCGACCTGAT CAACAGCAAT CTCGAAATTG TTGGCGTGAC TGGCGGCTTG
ACGGTCAATG ACCAAACCGC CAGCGTCAAT AGCAGCCTCG CACCAAGCAG TAACAAAACC
TATACCATCG CTGTGCGCAG CGTGGGCAAT TTCGTGGGTA GCGTTGGCAA CACCGCTGTC
CTCAACAGCA ATATTAACTT GACCGCCGAT AGCGTGACAG TCAATGCAAC GACAACCCCA
AGCTTCACTG TGTACATGCC ATTAGTCACC AAAAACTAA
 
Protein sequence
MQFKGLRLLT ALTMLFTLIG TSLSATYTTP TSAVSTSLVI SQFQTAGGTA DDEFIEIHNI 
SANPVDLNGH RVVYRAAAGV TDVSIASWTS SVVIPAGGFY LLGRANSYDG TVTADTTFGS
GISGTGGGFA IRNGDLNTGT IIDSVGFGSA TNAFVEGTVV AAPSANNSAQ RNNSSCTDTD
NNAADFSLLT PSAPRNSSSP VATCGGLPTD NPPTVASTVP ANNANNVALN ANVTLTFSEP
VTTTGTWYTI SCTSGARTAS VTGGPTSYVL DPSSDFAASD ACTVTVVASQ VTDRDGSVDA
MAQDYSFNFN VSSGVACSGG TVTPIGQVQG TGETSPANGT VVTVQGTVVA DFEGAQPALR
GFYLQDAGDS NAATSDGIFV FNGGADQVAL GQVVRVTGTA GENQGQTQLS GSLTVELCGT
TNTVTPTQVT LPFASTTDAE RFEGMLVQFN QKLVVSEHYL LARFGQVTLS LNERLAQPTN
VVSPGAPALA LQAQNNLHKI ILDDPFNNQN ADPILFGRGG TGLSAANTLR ASDSITGLQG
VMTYTWGGNS ASPNSYRVRV TQSPNFVAEN QRPTSLELDG SLRVAGINVL NYFNTFGNGN
CTGGVGGAAT DCRGAENTTE FQRQKDKTIA AIIMTQADVI GLMEIENDGF GSNSAIQDLV
NGLNAATAPG TYALVDVDTR TGQTNPLGTD AIKVALLYKP GKVSLAGTTA VLNTGAFGDF
TTSSGTTGRN RPALTQSFTE TSSGETFTVV VNHLKSKGSA CTDNVSPVPS DPDLGDGQGN
CNLTRKTAAQ ELVAWLNTDP TNVDDSDYLI IGDLNSYAME DPITAIRNAG YVNIPKTLLG
DEAYSYIFDG QTGSLDHALA STSLFSQVAD VQELHINADE PIALDYNTNF KSAGQIVSLY
NNDAYRASDH DPIVVGLALG AAPTANFSTS TKTVSSTSVE QGEVFTYTLT IRNTGAAEGS
FSLSDLINSN LEIVGVTGGL TVNDQTASVN SSLAPSSNKT YTIAVRSVGN FVGSVGNTAV
LNSNINLTAD SVTVNATTTP SFTVYMPLVT KN