Gene Haur_4013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4013 
Symbol 
ID5735874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5119846 
End bp5121582 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content54% 
IMG OID641281163 
Productphosphotransferase domain-containing protein 
Protein accessionYP_001546773 
Protein GI159900526 
COG category[L] Replication, recombination and repair 
COG ID[COG1796] DNA polymerase IV (family X) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAGCA ACCAAGCAAT TGCCCAAGTG TTTGCCGATA TTGCCGATGC CTTGGAAGTG 
ATCGGCGAAA ATCGTTTTCG ATTGGCGGCC TATCGTCGTG CCAGCGATAC CCTCGCCGCC
CAAACGACCA GCCTTGCCAG CCTGCGTGAG CAAGGGCTAT TAACCAGTTT GCCCAATATT
GGCGAGGCCA GCGCGGCAAT TATTGGCGAA TTGCTTGATC ATGGCCATTC GGCCTTGGCC
GACCAGCTTT TGGAAAAAGT TCCGCCTGGG TTGTTGCAGA TTTTGCGTGT GCCCGAAATC
GGCCCCAAAA CCGCAGCTCG GCTTTTTCGC GATGAAGGCA TTCAGGATCT TGAGGCCTTG
TTTGCGGCGG CTCAGGATGG GCGTTTGGCC AAAATTAAAG GCTTTGGCGC GAAAAGTGCA
GCCAAAGTGT TGAGCGCTCT CGAAGCCATG CAAAATCAGG TGCTGCGCTT GCGGTTGGTC
GATGCCTTGC CCGTGGCCCA AGCATTAAGC GCCACGTTAG CTGAAATTGC CAGCATTAGC
GCGGTGCAAG TGGTTGGTTC AACCCGTCGT TATCAAGCCA GTGTTGGCGA TTTGAATTTT
GTGGTAGCTA GCTTAGATCA AGCTGCCGCC TATGCCGCGA TCGCTGAATT ACCTCAAGTT
GCCCAAGCTA CCCCCAGCGA CAACGGCTTG CGCTTGTTGT TGCACAACGG CATGAATGCT
TGGATTGTAG CGGTTGCGCC CGAACATTGG GGCAGCGCTT TGGTCTATTG GACTGGCTCG
ACTAGCCATG TAGCTGGCTT AAATCAATTG GCTCAGGCCC AGAATTGGCA GCTTGATCCG
CTTAATTTTC CTGATTTTGC CGATGAAGCA GCGGTGTATG CAGCTTTAGG CTTGGAATGG
ATCGCGCCTG AATTGCGCGA AGGCTGGGGC GAAATTGCGC TAGCTCAAGC CCAGCAATTG
CCCAAATTGC TCGAACAATC GGCGATTATC AGCGATGTAC ATTGGCACAC GACGTGGAGT
GATGGCAGTG CCAGCCTACG CGAGATGGCG CTAGCCGGCA TTGCCAAGGG TTATCGCTAT
ATGGCGGTGG CTGATCATAG TGCCTATTTA GGCGTAACGG GTGGGCTAGA TGGCGAACGT
TTGCTGGCTC AACGCGCCGA AATCAACGCA CTCAATCAGC AATTAGCCGC CGAAGGCCAT
GAATTTCGCT TATTGCAAAG CTGCGAAGTC GATATTTTGC CCGATGGCAG CTTGGCCTTG
CCCGATGAGG TGCTAGCCAC ATTAGATTTA GTGGTGGCTT CGCCGCATGT GGCCTTACGC
CAGCCCCGCG CTGAATCGAC CGCCCGCATG CTGCGAGCGA TTAACCATCC CTTGGTGACG
ATCATCGGAC ACCCAACTGG GCGGATTTTG AATGGCCGTG CTGGAGCCGA TTATGATATG
GCGCAGATTA TTGCGGCAGC AGCGGCAACG GGCACAGTTT TGGAAGTTAA TGCTGGGCCT
GAACGGCTTG ATTTGGATGC GCCAAATGTG CGGGCAGCGC TGGCGGCTGG GTGTAATATC
AGCATCAACA CCGATGCCCA TGCCACGGCA GGTTTCGATA ATCTGTTTTA TGGAGTTGTA
ACGGCTCGCC GTGGTGGCGC TAGCAACCAG CAAGTAATCA ACACATGGGA TGCTGAGGCA
GTATTGGCGC TGCGTCAGCA AAAATTACAA AAACTTGGGC TTAGTAGCGA ATCGTAG
 
Protein sequence
MLSNQAIAQV FADIADALEV IGENRFRLAA YRRASDTLAA QTTSLASLRE QGLLTSLPNI 
GEASAAIIGE LLDHGHSALA DQLLEKVPPG LLQILRVPEI GPKTAARLFR DEGIQDLEAL
FAAAQDGRLA KIKGFGAKSA AKVLSALEAM QNQVLRLRLV DALPVAQALS ATLAEIASIS
AVQVVGSTRR YQASVGDLNF VVASLDQAAA YAAIAELPQV AQATPSDNGL RLLLHNGMNA
WIVAVAPEHW GSALVYWTGS TSHVAGLNQL AQAQNWQLDP LNFPDFADEA AVYAALGLEW
IAPELREGWG EIALAQAQQL PKLLEQSAII SDVHWHTTWS DGSASLREMA LAGIAKGYRY
MAVADHSAYL GVTGGLDGER LLAQRAEINA LNQQLAAEGH EFRLLQSCEV DILPDGSLAL
PDEVLATLDL VVASPHVALR QPRAESTARM LRAINHPLVT IIGHPTGRIL NGRAGADYDM
AQIIAAAAAT GTVLEVNAGP ERLDLDAPNV RAALAAGCNI SINTDAHATA GFDNLFYGVV
TARRGGASNQ QVINTWDAEA VLALRQQKLQ KLGLSSES