Gene Haur_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1107 
Symbol 
ID5732998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1267546 
End bp1268670 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content52% 
IMG OID641278245 
Productradical SAM domain-containing protein 
Protein accessionYP_001543883 
Protein GI159897636 
COG category[L] Replication, recombination and repair 
COG ID[COG1533] DNA repair photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.614864 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTATT ATGTTGAAGA GCGCCCAGGT GGGCCAGCGC TAGCACCTCG TCGCCCCACA 
ATTAATGAAT TTTTTCTCTC GACCTATCAA GTTGGTCCAT ATGTGGGCTG CGAGTTTGGG
TGTGCCTATT GCGATGGCTG GTCGTTCAGT CAGCGGCCAT TTAACGAGGT TATCCGCGCT
AATGTTGATT TGCCTGATCG CTTTGCCGAG CAACTGAGCG TGGTTTCACG CGGCGATCTA
ATTGCCTTCA GCCTTGGCGA TGCCTACCAA CCTGCCGAAA AAACCTATCG CCTCACCCGC
CAGATGCTCC AGGCTTGCCA AGTTGCCAAG CAACCAGTGT TAATTTTGAC CAAAAGCTTG
GCAGTGATGG ATGATTTGAG CTTGCTGCAA CGCATGAATG AGCAGGGCTT GGCGATTGTG
GTGATGAGCA TTCCGACGAT TGATCCCTTG CTCTCGGAAA AATTAGAGGG CAAAGTTGCC
CCCCCCTCAG CTCGTTTGGA AGCCTTGAAT ACCCTCAAAC GTGCAGGCAT TCCAACTGGC
GTGGCGATGT TGCCAGTTAT TCCGTATCTG ACCGACACTG ATCGCCAATT GCCTTTGACC
TTGAATGCGA TCGCCAATGT TCAGCCCGAT TTTGTGGTTT GGGAATATCT ATGGCAGCCG
AATGAACGCC ATCGCCAACG AATTACCGAT TTGCTTTCGC GCTTGGGCAA TTATCCCGCC
TCATATTATC GTGAATTGTA TGGCAAGGAT ATGCAGCCGA GCCTCGAATA TCGCCGTGAG
ATGCATCGCG ATATTTTGGG GCGTTTTGAA GAGCTGAATC TTAACCCGCG AGCACCACTG
GAGTTGTATC GCGAGCATTT GGCTCCCAAT AATGTGGCGG CATTGATGCT CAAACATCAA
GCCTTTATCG ACCAAATCAA GGGTCGCGAA CTATTGGCCA GCCGCCACTC GAATTTGGCC
GAAGCGGTGT TCAATGGCAA AGCCGATGAG CCAGCCTTGG CGGTTAGCCC ATTGTGGCCG
ATGTTGCGCG AAGTGCTGAA TATTAGTGAT ACCCGCGCCC GACTCGACCA AATTCTTGAA
AAAGTGCGTA ACCCTGATAC CCCTAACGAT CCCAGTAGCG AGTGA
 
Protein sequence
MAYYVEERPG GPALAPRRPT INEFFLSTYQ VGPYVGCEFG CAYCDGWSFS QRPFNEVIRA 
NVDLPDRFAE QLSVVSRGDL IAFSLGDAYQ PAEKTYRLTR QMLQACQVAK QPVLILTKSL
AVMDDLSLLQ RMNEQGLAIV VMSIPTIDPL LSEKLEGKVA PPSARLEALN TLKRAGIPTG
VAMLPVIPYL TDTDRQLPLT LNAIANVQPD FVVWEYLWQP NERHRQRITD LLSRLGNYPA
SYYRELYGKD MQPSLEYRRE MHRDILGRFE ELNLNPRAPL ELYREHLAPN NVAALMLKHQ
AFIDQIKGRE LLASRHSNLA EAVFNGKADE PALAVSPLWP MLREVLNISD TRARLDQILE
KVRNPDTPND PSSE