Gene Haur_0255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0255 
Symbol 
ID5732150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp298387 
End bp300078 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content55% 
IMG OID641277379 
Producthypothetical protein 
Protein accessionYP_001543035 
Protein GI159896788 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAAC CAACTGAGAT TGTTCACCTG CTACGCCCCT TCGGGCGGCG CTTACGCCTT 
GCCGATACAT TGCAATGGCT GAGTCGTACA TTTTGGCTAC CGGCGCTTGG CTTTGCCTTA
ATCCAAGGTG CTGGTCGGCT CTTCCCAATT CCTAACTGGA CATTGTGGTC GTTAGTGCCC
TTCGCGGTTT GGCTCGTTGC CTTACTGGTC GTCGCGTTGC GCCCCCAATC GAGCAACCGC
ATCGCCCAAC GCAGCGATCT CGAGCTCGAT TTGCGCGAAC GGCTCTCGAC TGCACTCGAA
TTGCACAAAC GCGATGATCG CGGGCCACTC GACGATGTTC AATATAATGA TGCGCTTGAG
AAAGCCCGCA ACGCCAATGC CAAGGATATT CGGGTTGCAC CACCACGCAA ACATCGCTTG
TGGTGGGGTT TGGCAACAAT GTTCCTTGTG CTGGGCATTA CTTCGGCGGT ATTGCCGAAT
GCCCAAAGCA ATGTTTTGGC TGAACGCGAA GCGGTCAAAA CCGATCTCGA ACGAATCGCC
GACCGTATCG ACCAAACCCG CGAAGAAATT GCCAAAGACG AAACGCTTTC ACCCGAAGAA
CGCGTCGAAT TGGATCGCCA ATTGGCTGAG TTATCCAAAG ATTTGCGCGA AAATACAGGC
TCACGCGAAG ATGCTTTGGC GAAAATCTCG CGCACCGAGC AACAATTGCA AAAGCGCTTG
GATTCACGCG CTAGCGCTCG TCAGGCTTCG TTGCAAGAAT TGGCCCAAGC TTTGAAAAAC
CAACGTGGAG CCAGTCAAGA GCAACGCCCT GAAGCCAGCG ATGCCGCCAA AGAGTTGGAA
CAAGCAGCCC AAGATGCTGA AAAGATGACT CCTGAGCAGC GTGAGCAATT GGCGAATGCG
CTTGAGCAAC AAGCCAACCA AACTGCGGCA ACGAATCCTG AATTGGCTCA ATCGTTGCGC
GATGCTGCTA GCGCTTTACG CAACGGCAAT CTTGCCGATG CTCGCTCAGC CTTGCAACGA
GCCAGCAACC AAGCCAGCCA AAGCGCTGAG CAATTGCAAG ATCAACAAGG CGTTGAGCAG
GCTTTGAGCG AAGTGCAAAA TAGCCGTGAC CAAGCGGCCC AAGCTGGTCA ACAAGGCCAA
CAAAATCAAC AAGCTGGTCA ACAAGGTCAG CAGGGTCAGC AGGGTCAGCA GGGTCAGCAG
GGTCAGCAGG GTCAGCAAGG CCAGCAAGGT CAGCAAGGTC AGCAAGGCCA AGGTCAAGGT
CAGGGTCAAG GCCAAGGTCA AGGCCAAGGT CAGGGCCAGG GTCAAGGTCA GGGCAATGGT
GCTGGCGGCG GCGGTGGCAG CCAATCCAAC AATCTTGGCT CAGGCAACAG CAATGGCAGT
GGCAATGGCA CAGGCCGCAA CGATCGCGAC TTCCCTGGCA CTGGCAACGA TAAAGGCCTT
GTCTACCAAC CATGGAAACC AGGCGCTGCC AATGACCCAA GTAGTGTGAG TGGCCAGCCC
AATGGCAGCG GTAACGCGCC CAGCCAACCG AGCGGCAGCA CTGGCCCAGG CATCGCCAAT
GGCTCACAAG TGCCCTATAA TCAAGCTGGC TCGGACTACC AAGAATCGGC AGGGCGGGCG
GTGGATAGCG GATACATCCC ACCACAATTG AAAAACTTTA TCCGCGACTA CTTTAACCAA
TTGGAACAGT AA
 
Protein sequence
MAQPTEIVHL LRPFGRRLRL ADTLQWLSRT FWLPALGFAL IQGAGRLFPI PNWTLWSLVP 
FAVWLVALLV VALRPQSSNR IAQRSDLELD LRERLSTALE LHKRDDRGPL DDVQYNDALE
KARNANAKDI RVAPPRKHRL WWGLATMFLV LGITSAVLPN AQSNVLAERE AVKTDLERIA
DRIDQTREEI AKDETLSPEE RVELDRQLAE LSKDLRENTG SREDALAKIS RTEQQLQKRL
DSRASARQAS LQELAQALKN QRGASQEQRP EASDAAKELE QAAQDAEKMT PEQREQLANA
LEQQANQTAA TNPELAQSLR DAASALRNGN LADARSALQR ASNQASQSAE QLQDQQGVEQ
ALSEVQNSRD QAAQAGQQGQ QNQQAGQQGQ QGQQGQQGQQ GQQGQQGQQG QQGQQGQGQG
QGQGQGQGQG QGQGQGQGNG AGGGGGSQSN NLGSGNSNGS GNGTGRNDRD FPGTGNDKGL
VYQPWKPGAA NDPSSVSGQP NGSGNAPSQP SGSTGPGIAN GSQVPYNQAG SDYQESAGRA
VDSGYIPPQL KNFIRDYFNQ LEQ