Gene Haur_5289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5289 
Symbol 
ID5737247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp77853 
End bp80426 
Gene Length2574 bp 
Protein Length857 aa 
Translation table11 
GC content37% 
IMG OID641282453 
ProductTPR repeat-containing protein 
Protein accessionYP_001548044 
Protein GI159901799 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.752131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTC GAACCCTTAT CCCACCCCTT GTTGATGCTG TGTTAGCAGT GTGTCCTGTT 
TATGAGCGTG CCACTATAGC CACGGCACTT GAACGAGTTC TTGTTGGTGA ACACATCACG
CTTGGCAATA ACACAATCTC AATGTTGTTT GGTCAAAATA ATGATTTTAG CAATGCAAAG
ATTATGATTG GTTCGATCCA AGCTGGACAT ACTATTTCTA TTAATGTTCA ACCTATCATA
GATAAATCTT TTTCTGATTC TACTCATACT ACCGATGATG ATTTAAAGCA AGCAAGGTTA
TTGCTATCTC ATATACCATT GGCCTCGCTA CCAAAGAAAG GGTTATTGCC TAAAGGTTCA
CGGATTCCCT TTAAAGATAA CCCTTTCTTT GTTGGGCGAG ACGACATGTT ACTTTCTATT
GCTAGTACTT TTTTTTCTTG TCACTCTGAT GCACCTATTC CGACTATTGG ATTAGTGGGA
ATGGGAGGGA TTGGAAAAAC GCAGTTGGCT GTGGAATTTG TCTATCGCTA CGGTTCATAC
TTTGCTGGCG GGATCTTTTG GCTTTCCTTT GCACAGCCAG ATTCGATTAA TACAGAGGTG
ATTGATTGTT ATAAATATTA CTGTCCACAA GTTATCGAAG ATTCGGCTGA AAAACAGATT
GCCTATATGA AATCCCTTTG GATGAATCCT TTACCTCGAT TACTAGTTTT TGATGACTGT
AACGAGGTTG ATTTGCTAGA GAAATGGCGG CCCCAATCAG GGGGATGTTA CGTGTTGGTC
ACGAGCCGTA GGCAGCAATG GCCTGCAACT GTAGAACTTT CCCTCCTGTC TGTATCAACA
CTTGATCTTG CAGGCAGCCT TGACTTACTT TGCCTTTATC GCCCTGATAT AAGAGAAGAC
CAAGCCTTAG GACAAAAGAT TGCTCAAAAA CTTGCTAATC TGCCATTAGC TATTCATATG
GCTGGAAGTT ATCTAGCACA CTATAAACTT AAGCTAGAAG TATATTTAGC CCAATTGGAT
CAAGGAATTA CCCACGAATC AATGAAGGGG AGAGGAACGT TCCATCAACC TACTAATCAT
GAAAGTGTTA ATGTTACGTT TAATATGGCG TTAAATAACC TTTCCAACCA TGAACCAGTG
AATATAATTT CACGTTTGCT CTTAGCTAGG ACAAGTTGGT TAATGTGTAA TGAGCCAATT
CCTAAAACAC TATTACAATC GTTTGCAGTA GATAAAGGGT ATGATGATTT AGATATTATA
GACTCTATAC ATAAAATTGT TAACTTAGGC CTCTTAGAAA TAACTATCCA TACAGGATTT
CGAATTCATG AACTGATAGC AGGATTAATA AAGGATAAAA TTAATGATGT TTCTGCATAT
TCAGATGTAG AGAGGATTTT AGGATCAAAA CTAGCTTCTC GGTCTACTTG GGAGGAAAGA
GAAGAATTAC AGTGGCTCAT TCCTCATGCA TATACAATTG TTCAATATGC ACTGAAGAGA
CAAGATACTA ATAGTGCAGA TTTTATGTAT GGTTTTGCAA GATCCCTCAA GAGAAATTCT
GATTATAAGG CATCTTTTAT GTATCATCAG AAAGCCCTAG CAATTCGTAA AAAAATCTTT
GGTGATAATC ATGTAAATAC AGCAAAAAGT TTGAATATGC TTGGGATGTT ATGTCGAAAA
ATGGCCAACT TTAATATGGC CAAGGAGTTT TATGAACAGG CATTAGAAAT TTATCAAAGA
GATTTGGGTG ATGATCATCC TACTACACTG AGTACTTTAA ATAATCTAGG CTATCTATTA
AAAGCGCAGG GCGATTTACT TCAGGCCCGG GAGTGTTATC AAAAAGTACT AAAGAGTCGT
CTTATAAATA GAGGGGAGGA ACACAGAACT ACAGGTACGA TTTTTAATAA TTTAGGCATG
GTTCTAAAGG ATCTGGGTGA TTTAAAATCC GCCCAAGAAC ATGTAAAGAA AGCAGTAGAA
ATTCGTAAAA GGATATGTGG TGATAACCAT CCCGATACTC TTATAGCTCT ATATAATTTA
GCTAGCATTC TTTTTGATTT AGGGTTGATT GAGGATTCCT ACAAAGTTGC AAAATCAGTC
CTTGACGATC GTTGCCGTAT TTTAGGGGAT GAACATTCTA GTACTGCAAG TAGTTTGTAT
CAGGTTGGAA AATTGCTCCA TACTAAAGGT GAGATTGATT CAGCGCTGTA TAATCTTGAG
AAAGCATTGA CGATTCAAAA GAAATTGCTT GGTATTGATA ATCCATATAC AGCTTTAACA
CTTCAAGAAT TGGGTAGGTT GTTTCAATCA AAAGGAGAGT TTGAATTGGC ACGACATAAT
ATTGAGTATG CGCTTGGTAT TCAACAAAGA ATATTCGGAT TAAATCATCC TGCTATCGGT
TTAAGTTTTC ATAATCTAGG TGAATTATAT GAGAAGATGG GGAATTTGCA GATTGCTCAC
TTCTACTATA AGCAAGCATT AGAGGTTAGA ATACATATTC TTGGAGAAAA TCACCCATCA
ACAATAGGCA CAATAGATTG CCTTAATCGT AGCAGTAATT GGAATAAAAT ATAG
 
Protein sequence
MDIRTLIPPL VDAVLAVCPV YERATIATAL ERVLVGEHIT LGNNTISMLF GQNNDFSNAK 
IMIGSIQAGH TISINVQPII DKSFSDSTHT TDDDLKQARL LLSHIPLASL PKKGLLPKGS
RIPFKDNPFF VGRDDMLLSI ASTFFSCHSD APIPTIGLVG MGGIGKTQLA VEFVYRYGSY
FAGGIFWLSF AQPDSINTEV IDCYKYYCPQ VIEDSAEKQI AYMKSLWMNP LPRLLVFDDC
NEVDLLEKWR PQSGGCYVLV TSRRQQWPAT VELSLLSVST LDLAGSLDLL CLYRPDIRED
QALGQKIAQK LANLPLAIHM AGSYLAHYKL KLEVYLAQLD QGITHESMKG RGTFHQPTNH
ESVNVTFNMA LNNLSNHEPV NIISRLLLAR TSWLMCNEPI PKTLLQSFAV DKGYDDLDII
DSIHKIVNLG LLEITIHTGF RIHELIAGLI KDKINDVSAY SDVERILGSK LASRSTWEER
EELQWLIPHA YTIVQYALKR QDTNSADFMY GFARSLKRNS DYKASFMYHQ KALAIRKKIF
GDNHVNTAKS LNMLGMLCRK MANFNMAKEF YEQALEIYQR DLGDDHPTTL STLNNLGYLL
KAQGDLLQAR ECYQKVLKSR LINRGEEHRT TGTIFNNLGM VLKDLGDLKS AQEHVKKAVE
IRKRICGDNH PDTLIALYNL ASILFDLGLI EDSYKVAKSV LDDRCRILGD EHSSTASSLY
QVGKLLHTKG EIDSALYNLE KALTIQKKLL GIDNPYTALT LQELGRLFQS KGEFELARHN
IEYALGIQQR IFGLNHPAIG LSFHNLGELY EKMGNLQIAH FYYKQALEVR IHILGENHPS
TIGTIDCLNR SSNWNKI