Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3452 |
Symbol | |
ID | 5735313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4340323 |
End bp | 4341567 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280599 |
Product | nuclease SbcCD, D subunit |
Protein accession | YP_001546216 |
Protein GI | 159899969 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | [TIGR00619] exonuclease SbcD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.527539 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTC TTCACCTCGC AGATATTCAC ATTGGCATGG AAAATTATGG CCGAATCGAT AGCACAACTG GCCTCAACAC CCGCTTGATC GATTATCTTG ATCGGTTTGC CGAGGCTTTG CAGATTGGCA TCGAGCATGA TGTCGATTTG GTGCTGATTG CTGGCGATAT TTACAAAAAC CGCACGCCCA ACCCAACCCA TCAACGTGAA TTTGCTCGTC GCCTGCGCAG TGTGCTCGAT CGGGGCATTC CCGTATTTAT GTTGGTTGGC AATCACGATG TTTCAGCCGC CGCAGGCAAA GCTCATTCGG TCGAAATTTT CGATACCCTC GCCATCGATG GGGTAACAAT TGCCGATCGG CTTGGGATTC ATACGATCGA AACTCGCGCC GGTAGCATTC AAATTGTGGC CGTGCCATGG ATCAGCCGCC ACGCCATTTT GACCAAAGAT GATATTCGTG AGTTGCCATT TGCTGAACTC GAAGCTGAAT TATTGCGGCG GGTAGGGGCT TGGCTCGAAC AAGTGCCCGA GCGGTTGCGC GGCGATTTAC CAGCGATCTT GACCTTTCAT GGCACTGTTT CCAATGCCAC CTATGGCGCT GAACGCTCGG TCATGTTGGG CAATGATCTG ATTCTGCCGC CATCACTTTT GGCCCAGCCA GGCATTCAAT ATGTCGCCTT GGGCCATATT CACCGCTATC AAGTGCTCAG CGAAAATCCC CCAATGATCT ACCCTGGCTC GATTGAGCGC ATCGATTTTA GCGAGGAATC TGAGCAAAAA CAAGTGGTAA TTGTTGAAAT TGAAAATAAT TGGGAAGATG CCAGCTATCA GCCAATTGCG GTGCATCCGC GCCCATTCGT CACGATCAAA GTTGATGTAA CTGGCAGCAG CGACCCCATG GAGCGGGTGG CCCAAGCGAT TAGCAAGCGC GATTTAAATG GCGCGGTGGT GCGTTTGTTA ATTAGCGCTA CTGCTGAGCA ACGCCCACAG CTTGATGAGA CTGAACTGCG ACGCTTGCTC GAGGCTGCCG AAACCCATGT GATCGCCAGC ATTGCGATTG AGGCCCAACG CAGCGAACGC ACTCGCTATG CTGCGGTTGC CAGCGAATTG AATGAAGGAT TAACTCCACG CCGCGCCCTC GAAATCTACC TTGAAAGCAG CAATATCAGC GCCACTCGCC GCGAACAGAT GCTCAAAGCC GCCGATGACT TGATCAAAGC CGAACAAGCA CGTGAGCAAG CATGA
|
Protein sequence | MKILHLADIH IGMENYGRID STTGLNTRLI DYLDRFAEAL QIGIEHDVDL VLIAGDIYKN RTPNPTHQRE FARRLRSVLD RGIPVFMLVG NHDVSAAAGK AHSVEIFDTL AIDGVTIADR LGIHTIETRA GSIQIVAVPW ISRHAILTKD DIRELPFAEL EAELLRRVGA WLEQVPERLR GDLPAILTFH GTVSNATYGA ERSVMLGNDL ILPPSLLAQP GIQYVALGHI HRYQVLSENP PMIYPGSIER IDFSEESEQK QVVIVEIENN WEDASYQPIA VHPRPFVTIK VDVTGSSDPM ERVAQAISKR DLNGAVVRLL ISATAEQRPQ LDETELRRLL EAAETHVIAS IAIEAQRSER TRYAAVASEL NEGLTPRRAL EIYLESSNIS ATRREQMLKA ADDLIKAEQA REQA
|
| |