Gene Haur_2257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2257 
Symbol 
ID5734144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2885853 
End bp2888339 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content47% 
IMG OID641279398 
Producttetratricopeptide TPR_4 
Protein accessionYP_001545025 
Protein GI159898778 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCC TGCCTAATGC TGAAACGCTG CAACAAGCCG AAGCCTTGCT CGCTCAAATT 
CCACTTGATC GCATTCCAGC CCATCAACGC CCACGCGCTG ATTGGATGCT TGACGATCAG
TTTCCGATCA GCAGTTTTGT TGGGCGCGAA GATTTGTTAA AGCAATTAGC GGCGGCCATG
GCCTCAACCA CCCCAACTAT GATTGTGCCA ACGCTGGCGA TTACTGGCAT GGGCGGCATT
GGCAAAACCA GCCTTGCGCT CGAATTTGCC TATCGCTATG GCCATTATTT TGCTGGTGGC
GTATATTGGA TCAACGCCGA CTATACGCCA ATCGCCACCA CGGCTGCAAC GATCTTGCCC
TCGGTTGATC GATTGTGGCA AAAGTTATTT CCTCAGCGTG ATAGCAGCCA AATCAGCCCT
GAACAGCGGC TTAACGAGAT TAAAAGCTTT TTCAATAGCC CAATTCCGCG CCTGTTGATT
TTTGATAATT GCGAGCAACA ATGGATTTTT GAAAGCTATC GGCCAGGCCC GCAAAGCGGC
TGTCGTGTGC TGATGACTAG TCGCAACGCG GTTTGGTCAT CGAGCAATGT TCGTGCAATT
GCGATTGATT TGCTGACCCC AGCTGAAAGT CGCCAGATGC TGCAAAAACT TGCTCCACGA
GTTACCGATG CTGAGGCCGA TGATTTAGCC AAACTGGTGG GTTATTTGCC TTTAGCATTG
CATGTAATGG GCGTGGCCTT GGGAACACTT GAACCATCAT TACCTGTCGC CAATTATTAT
CAACGAGTGC AGCAAGCCTT AGTAGCCGAA CTCGAAACCA GCGCCAACAC CTTGCAAAAC
CTCCATCGTT CGCCAACTAA CCATCAATGG AGCGTCGTTG CTACGGTGCG CGTGAGCTAT
GGCCTGCTCA AACGCAGCTA TCAGGATGAA GCCAAACTAC GCCATCTGTT ATTGTTGCTG
GCATGTTGTG CGCCAAATGC GCCAATTCCA ATCGATCTGT TGGTACGAGC AACCGAGCAG
GATTCGGCGA CCGTTGGCGG ATGGTTGTAC GTACTACGCC AAAGCGGCTT TTTCGATCAC
GACCCACCGC AGTTGCACCC GCTGATGCGC GAGGCGATTC GCATTATTGA AGCCGAGCAC
TACCCCAACG CCGCCAACAT AATGACCGCT GCTTTGGTGG CTGAGGGCAA GGATGCTCAT
GAAAAATGGC AGCGTGAGGC GATGTTGGCT CTAGTGCCTC ACCTAAGCGC TTGCCATGAA
ACCGAAAAAA CGAAGCAAGG TTATACGGGC AATGTACTAG CAATTATCGC TCAAATTAAC
CAACGACAAG GTAACTACCG CCAAGCAGAG CAATCTATGC GTGAAGTGTT AGAGTATGAA
ATTGCCGTGT ATGGTTACGA GAAACAAGAA GTGATTACAA CTCAGCATAA TCTTGCTAAT
ATATTATACG ATCAAGGTCT ATATATAGAA GCATTGAACC TCTTCCAAGA AATACTAACT
ATCGAACAAC AAATATTAGA CGCAGAACAT CCCCATATTC TAGCTACTAA ACATGAACTC
GCAAGGGTTT TACAGGCCCA AGGTGAATAT GCACAAGCCT TGGAGTTATA TCAAACCGTC
CTTGCTAGTA ACCAACGAGT TTTAGGCACT GATCATCCTT CAACTCTCGC TTCTCAGCAT
AATATTGCAA GTGTGTTTCT TGCCCAAGGA GATTACATCC AAGCACAGGA ACTCTACCAA
ACAGTCTTTA CCATTCAACA ACGAGTTTTG GGCGAAAATC ATCCTTCTAC CCTTGCCGCT
CAACATGAGC TTGCGAGGGT GTTAGTCGCT CAAGGCAACT ATGTAAAAGC ACAGGATATT
TTCAAGGCAG TCCTTGTCAT TAATCAACGA AACTTAGGGA CGGATCATCC TCATACACTC
ACCACCCAAC ATGAACTTGC GAGGGTATTC CTCATGCTAG GTGACTATGA TCAATCCTTG
GATCTCTTTC AAACAGTTCT CGTTATAAAT CAAAGGGTTT TAGGGGCAGA GCATCCTTTG
ACCCTCTCCA CTCAACATCA TCTAGCAAGC ATATTTCTCG CTCAAGGCAA CTATGTAAAA
GCACAGGAAG TTTTTCAGGC AGTTCTTCCT ACCAAACAAC AGGTTTTAGG CGCGGAGCAT
CCCGATACCC TCGCTACCCA GCATAATATA GCAAGTATAT TTTATAGCCA AGAAGCCTAC
GACCAAGCCC TAGATATTTC CCAAACAGTC CTCAACATTG AAAAACAAAC TTTAGGAGAT
GATCATCCTG ATACTCTCAT AACTCAATCC AATATTGCTG TATGCATGGC CCGACAGGGC
CAATATTACG AGGCTGTAGC GTTGTTCTAT GAGATTATCC CCAAGCAAAT TCGCTGTTAT
GGAGGAACTA CTCATCCCAA GGTGCAGGCG AGCATTGAAA TTCGTGATGC AATTGTGGCG
GCTTTTTGGC AAAAGGAGCA AGGCTAG
 
Protein sequence
MTTLPNAETL QQAEALLAQI PLDRIPAHQR PRADWMLDDQ FPISSFVGRE DLLKQLAAAM 
ASTTPTMIVP TLAITGMGGI GKTSLALEFA YRYGHYFAGG VYWINADYTP IATTAATILP
SVDRLWQKLF PQRDSSQISP EQRLNEIKSF FNSPIPRLLI FDNCEQQWIF ESYRPGPQSG
CRVLMTSRNA VWSSSNVRAI AIDLLTPAES RQMLQKLAPR VTDAEADDLA KLVGYLPLAL
HVMGVALGTL EPSLPVANYY QRVQQALVAE LETSANTLQN LHRSPTNHQW SVVATVRVSY
GLLKRSYQDE AKLRHLLLLL ACCAPNAPIP IDLLVRATEQ DSATVGGWLY VLRQSGFFDH
DPPQLHPLMR EAIRIIEAEH YPNAANIMTA ALVAEGKDAH EKWQREAMLA LVPHLSACHE
TEKTKQGYTG NVLAIIAQIN QRQGNYRQAE QSMREVLEYE IAVYGYEKQE VITTQHNLAN
ILYDQGLYIE ALNLFQEILT IEQQILDAEH PHILATKHEL ARVLQAQGEY AQALELYQTV
LASNQRVLGT DHPSTLASQH NIASVFLAQG DYIQAQELYQ TVFTIQQRVL GENHPSTLAA
QHELARVLVA QGNYVKAQDI FKAVLVINQR NLGTDHPHTL TTQHELARVF LMLGDYDQSL
DLFQTVLVIN QRVLGAEHPL TLSTQHHLAS IFLAQGNYVK AQEVFQAVLP TKQQVLGAEH
PDTLATQHNI ASIFYSQEAY DQALDISQTV LNIEKQTLGD DHPDTLITQS NIAVCMARQG
QYYEAVALFY EIIPKQIRCY GGTTHPKVQA SIEIRDAIVA AFWQKEQG