Gene Haur_1597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1597 
Symbol 
ID5733484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1850567 
End bp1852960 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content47% 
IMG OID641278736 
Producttetratricopeptide TPR_4 
Protein accessionYP_001544368 
Protein GI159898121 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00177125 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTAT TACCATCACA ACTCCCCCAA TATGCCACGC GCCTGATTGG TCGCACCCGT 
GCCCGCACGT TGGTAATCGA CCTGTTGCTT GATGCTCAAG CGCGGTTGGT AACATTATAT
GGCCAAAGTG GTGCTGGCAA AACGCGCCTA AGTTTGGAAG TTGCCGAGCA GGTGGGCGAA
ATTTTTCGTG ATGGCCGCTA TTTTGTAGCG CTCGCTCCTG TTTCGCAAGC CCAGTTTGTG
CTGCCAACAA TTGCCGCAAC GTTAGGCGTT GAAGAATCTC AGCACGAAGC AATTTTAGAT
TCGTTAATAT TGGCCTTGGC CGATAAGCAA ATTTTATTAA TTCTTGATAA TTTTGAGCAA
GTTGCTGGAG CCGCCAGCGA ACTATTGGAG TTAATCCGAC GTGCACCGAA CCTAACATGT
TTAATCACGA GTCGTCAAGC GCTCGAAGTT GCTGGCGAAA CCGCGATTAT GGTTCCAGCC
TTGCAATATC CTGAGCTTGG TGAAGACTAT CAGCTTGAAG ATTTAGAGCA ACATAGCGCA
ATTGGCTTAT TTGTTGATCG CATGCGCACA CGTCAGCCAC GGTTTCGCTT GAGCGCTGAT
AATGCTGGAG CTTTGGTCGA TATTTGTCGT TTGGTGCAGG GCTTGCCCTT GGCGATTGAG
TTAATCGCGG CCCATAGTGC TTCGTTGACC CCCCAAGATT TGCTGTTTTT CGTGCGTAAT
CATCTTTCCA TGGCGGCTTT GAATCCGAAA CAATCGGCGC GGCAAGCGAT TATCAAGCCA
GTGCTGGCTT GGAGTGTGAG CATGCTGCCA GCTGATGCCA AAGATATTTT TGCTCAATTA
GGTGTTTTTG CTGGTGGTGC AACCGTCGAA ACAATTAAAC AGGTTGGTTT GGTTGAAACC
ATGCCCTTTG AATCCAGCCT AAATGCCTTG ATTGATCGCC ATTTATTGCA AACTGAACAA
TTGCCAGGCC AAAAAGCCCG TTTTATTATG ATTGATGCGG TGCATGAATA TGCTTTAGAG
CAATTGCAAA AAACGGGCCG CCTATATTAT TTGCAAGAAC GTCATGCCAT CTATTATCAA
ATATTGAGCG AAACAGTGCA TCAGAATATC CGGGGGGCTG ATGGTGCTAA ATGGATCGAA
CAATTGCGTG GTGAGATTCA TAATATTCGC CAAGCGATGA TTTGGTCGCT TGATAGTAGC
GATGGTTTGG TCGCTCAGCG AATTGCTGGC AATCTCTATT TTTATTGGTA TCGCACGAGT
GCTTATCGCG AGGCTGTGGC ATGGCTCGAA CAAACCTATC AACATTCCAA TCGCAGCGAT
TTAAGTGCAA TTGCCCGAAT TGCAACAGGA TTAGGTGGTT TATTAATTAG TTTGCTCCGT
TTTGCCGAAG CTGAACGCTA TTTAATTGAA GCTCGTCGTT TGTGGCAAGA GCTTGGCTTA
CCGCACGATG AAATTAGCGC AATTGGTAAT TTAGCAGTAT TGTATGGCAC GCTTGGCCGT
TTGCATGATT CGCAATTGGC GTTTGAAGCA GCCTTGGCTT TAGCCCGCAA GGTAGGTAAT
CAACAGCGCG AAATTTTGAT GCTGCATAAT CTTGGCACAG TAGCCCAAGA ACGGAATCAA
TTAGCGACCG CCCAAGCCTA TTTTGAGCAA GCTTTAGCGC TCAAACAACA GGTTAATCAA
ACCTGGGATA TGTTTCTAAC CCAAATTAAT CTCGGCTTAG TAGCGGTTGA TCAAGGGCGT
TATGCTGAAG CTGAGCAATG GTTTGAGCAA GCGTTTATCA ATGCCTATGC AATCGGTGAT
CACGATAGTT TGGCCTATAT TCGTTATGCA CGCGGGATAT CCGCAGCTGA GCAAGCTGAT
TATGTCCAAG CTGAGTTGCA TTTTCGCGAG TCAGAGCGAG GGTGGCATAC CGTTGGCAAT
CTGGAAGGAG TTCAGCGTAG TTGGCTTGAG CAAGCAGCAC TTTTAATTGC CACTGCCAAT
TATGCCCAAG CTGCTGAATA TTTGCATAAG GTTGAGCCAA TTGAGGCACT AAGCCAAGAA
TTACAATTAC GCCATATCAT TTTGGCAACT CGTTTAGCAA TCGCAATCGA TGATCAGGCT
GCGATGCAAC ACCAAGCTCA ACGAATGCTC GCAACTGCTT TGGCCAGCGA GCTACGCCGG
TTTGATCTGA CGATGTTGCA GCATAGTGCG GCAGTCCTAG TCGCAACCCA ACCAACACTG
GCGGCTCAAC TTTTAGCAAC CGCCGAGCAG CTTCGGGTTG AACGTAACTT GCACCAAAGT
GTTGCTGAGC AACAATGGCT GGCCCAAACC AATGTAGCTC GGCTTGTACC AACCATAGCT
TTGGATTTAA CCGCTGCTTT GCAAGCGGCT CAGGCTAGCT TAGCTGCGCA ATAA
 
Protein sequence
MPLLPSQLPQ YATRLIGRTR ARTLVIDLLL DAQARLVTLY GQSGAGKTRL SLEVAEQVGE 
IFRDGRYFVA LAPVSQAQFV LPTIAATLGV EESQHEAILD SLILALADKQ ILLILDNFEQ
VAGAASELLE LIRRAPNLTC LITSRQALEV AGETAIMVPA LQYPELGEDY QLEDLEQHSA
IGLFVDRMRT RQPRFRLSAD NAGALVDICR LVQGLPLAIE LIAAHSASLT PQDLLFFVRN
HLSMAALNPK QSARQAIIKP VLAWSVSMLP ADAKDIFAQL GVFAGGATVE TIKQVGLVET
MPFESSLNAL IDRHLLQTEQ LPGQKARFIM IDAVHEYALE QLQKTGRLYY LQERHAIYYQ
ILSETVHQNI RGADGAKWIE QLRGEIHNIR QAMIWSLDSS DGLVAQRIAG NLYFYWYRTS
AYREAVAWLE QTYQHSNRSD LSAIARIATG LGGLLISLLR FAEAERYLIE ARRLWQELGL
PHDEISAIGN LAVLYGTLGR LHDSQLAFEA ALALARKVGN QQREILMLHN LGTVAQERNQ
LATAQAYFEQ ALALKQQVNQ TWDMFLTQIN LGLVAVDQGR YAEAEQWFEQ AFINAYAIGD
HDSLAYIRYA RGISAAEQAD YVQAELHFRE SERGWHTVGN LEGVQRSWLE QAALLIATAN
YAQAAEYLHK VEPIEALSQE LQLRHIILAT RLAIAIDDQA AMQHQAQRML ATALASELRR
FDLTMLQHSA AVLVATQPTL AAQLLATAEQ LRVERNLHQS VAEQQWLAQT NVARLVPTIA
LDLTAALQAA QASLAAQ