Gene Haur_3452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3452 
Symbol 
ID5735313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4340323 
End bp4341567 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content52% 
IMG OID641280599 
Productnuclease SbcCD, D subunit 
Protein accessionYP_001546216 
Protein GI159899969 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.527539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTC TTCACCTCGC AGATATTCAC ATTGGCATGG AAAATTATGG CCGAATCGAT 
AGCACAACTG GCCTCAACAC CCGCTTGATC GATTATCTTG ATCGGTTTGC CGAGGCTTTG
CAGATTGGCA TCGAGCATGA TGTCGATTTG GTGCTGATTG CTGGCGATAT TTACAAAAAC
CGCACGCCCA ACCCAACCCA TCAACGTGAA TTTGCTCGTC GCCTGCGCAG TGTGCTCGAT
CGGGGCATTC CCGTATTTAT GTTGGTTGGC AATCACGATG TTTCAGCCGC CGCAGGCAAA
GCTCATTCGG TCGAAATTTT CGATACCCTC GCCATCGATG GGGTAACAAT TGCCGATCGG
CTTGGGATTC ATACGATCGA AACTCGCGCC GGTAGCATTC AAATTGTGGC CGTGCCATGG
ATCAGCCGCC ACGCCATTTT GACCAAAGAT GATATTCGTG AGTTGCCATT TGCTGAACTC
GAAGCTGAAT TATTGCGGCG GGTAGGGGCT TGGCTCGAAC AAGTGCCCGA GCGGTTGCGC
GGCGATTTAC CAGCGATCTT GACCTTTCAT GGCACTGTTT CCAATGCCAC CTATGGCGCT
GAACGCTCGG TCATGTTGGG CAATGATCTG ATTCTGCCGC CATCACTTTT GGCCCAGCCA
GGCATTCAAT ATGTCGCCTT GGGCCATATT CACCGCTATC AAGTGCTCAG CGAAAATCCC
CCAATGATCT ACCCTGGCTC GATTGAGCGC ATCGATTTTA GCGAGGAATC TGAGCAAAAA
CAAGTGGTAA TTGTTGAAAT TGAAAATAAT TGGGAAGATG CCAGCTATCA GCCAATTGCG
GTGCATCCGC GCCCATTCGT CACGATCAAA GTTGATGTAA CTGGCAGCAG CGACCCCATG
GAGCGGGTGG CCCAAGCGAT TAGCAAGCGC GATTTAAATG GCGCGGTGGT GCGTTTGTTA
ATTAGCGCTA CTGCTGAGCA ACGCCCACAG CTTGATGAGA CTGAACTGCG ACGCTTGCTC
GAGGCTGCCG AAACCCATGT GATCGCCAGC ATTGCGATTG AGGCCCAACG CAGCGAACGC
ACTCGCTATG CTGCGGTTGC CAGCGAATTG AATGAAGGAT TAACTCCACG CCGCGCCCTC
GAAATCTACC TTGAAAGCAG CAATATCAGC GCCACTCGCC GCGAACAGAT GCTCAAAGCC
GCCGATGACT TGATCAAAGC CGAACAAGCA CGTGAGCAAG CATGA
 
Protein sequence
MKILHLADIH IGMENYGRID STTGLNTRLI DYLDRFAEAL QIGIEHDVDL VLIAGDIYKN 
RTPNPTHQRE FARRLRSVLD RGIPVFMLVG NHDVSAAAGK AHSVEIFDTL AIDGVTIADR
LGIHTIETRA GSIQIVAVPW ISRHAILTKD DIRELPFAEL EAELLRRVGA WLEQVPERLR
GDLPAILTFH GTVSNATYGA ERSVMLGNDL ILPPSLLAQP GIQYVALGHI HRYQVLSENP
PMIYPGSIER IDFSEESEQK QVVIVEIENN WEDASYQPIA VHPRPFVTIK VDVTGSSDPM
ERVAQAISKR DLNGAVVRLL ISATAEQRPQ LDETELRRLL EAAETHVIAS IAIEAQRSER
TRYAAVASEL NEGLTPRRAL EIYLESSNIS ATRREQMLKA ADDLIKAEQA REQA