Gene Haur_5267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5267 
Symbol 
ID5737225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp42419 
End bp44341 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content57% 
IMG OID641282431 
ProductTPR repeat-containing protein 
Protein accessionYP_001548022 
Protein GI159901777 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.433566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTCC CCGCGCTCAT ACCCATCGTG GCCGATGCCA TTCTTGCCCA CTGCCCCCAG 
TATGATCGTG CGAGTGTTAT GATGGCCCTC GAATCGGTCT TTGCTGGCCA TCCTGCGTTG
CTTGGTGACA AGACGATCTC GATGCTATTT GGCCAGAACA ATGATTTTAC CAATGCCACG
GTGACGATAG GCGACGTGCA TGCAGGCTAC ACGATGAACG TCCACCTCCC GCAGCCGCTT
GATCCGACTC TCGCCGCGCT CATTGCCCGT GCCTCGTTGC CTGCCACCAT CCACCATGGG
GATATGACGA TCAACGAGAG TGGTGTTTCA GGGGTTGTTG TCCGCGATAA TTATGGGTCA
ATTCATCAGA CGATTCACCA GCCCCAGCCG ATTGATCCGC TGCCCGCTGC CCTCGCTGCC
CTTGCCTCAA TGCCGCTTGG TTATGTACCA GAACCCCGTG CGGATTTGCC CCACCCCTCA
CGCCTGCCCT TTGAAGCCAG CCCGCACTTT GTTGGACGCG AAGAAGCATT GAAACAGTTG
GCACAGGCGA TTGGTTCAGC CCAACCAGCG GTCGTCATGC CAGCGGTGGC CACAGGGCTT
GGTGGTATTG GCAAAACCAG CCTCGTCACC GAATTTGCCT ATCGCTATGG AGTCTACTTT
TATGGCGGGG TGTTTTGGCT GAATTGTGCC GACCCCGATC AAGTGGCCAG TCAAATAGCC
GCGTGTGCGG TTGGCTTGAA ACTTGATACT ACTGGGATGG CGCTCGATGA GCAGGTGCAA
CGGGTTCTCA ACGTTTGGAA ATCGCCCATG CCACGCTTGC TGATTTTCGA TAACTGTGAA
GATCGCGTAC TGCTCAAACG GTGGAAGCCC ACGGTCGGCG GCTGTCGGGT GCTGGTGACT
TCTCGATCCA ATCAGTGGCC AACCCTAACC CAGATCCAGC TTGGGGTGCT CTCACCCGTA
GAAAGTCGAG CACTCTTGCA GTCGCTCTAT AGTGATCTCA CGGATGCTGA CGCTGATCTG
ATTGCGGAGG ATCTGGGACA TTTACCACTC GCGTTACATC TGGCAGGAAG CTATCTTCAT
ACCTATCGGC CTACGATTAC GGCGTATCGT GCTGCCCTCA GTATTGCCCA TGAATCGTTG
CATGGGTATG GAGCATTTGA CTCGCCAACC GAACATGAAC AGAATGTCGA CGCGACGTTT
ATGTTGAGTT TTAACCAGCT TGATCCGGAC AACACGATTG ATGCCTTGGC ATTGGGCATG
CTTGATGGGG CGGCGTGGTG TGCGCCGGGT GTACCAATTC CGCGCGAGTT GGTGCTGACA
TTCATGCCGG ATGAGACAAC CCATACCGCC AGGGTTGATG CGTTGCGGCG CTTACAGCAG
CTTGGATTGC TCGATGGCAC GGATGCCGTG GTGTTACACC GATTGCTCGC TCAGGTTGTG
CAGGCACATT TAGGATCGTC CGAAACGCTG GCCGTGGTGG AAGACCGGAT TGGTGCTGCA
GCATCCCGTG CGAATGGGAC GGGTGTGCCA CGTTCGATGC TGCCGCTTGC GCCGCATCTG
CGGTATGCCA CCATGCGGGC GTTGGATCGT GGTGATGCCC AGGCAGCCCG CCTTGCCAAT
AGTCTTGGTT TTTATGAAGA TCTGCGAGGA GCCTATGCTG CTGCAGAGGA GTTGCATGAA
CGGGCCTTGG CGGTGCGGGA AGCGGTGTTG GGGGCGGAGC ATCCCGATAC GGCGACGAGT
GTGAACAATC TGGCGGTGGT CTTGAAGCGG CAAGGGCGGT ATGCTGAGGC GCAGCGCTTG
TTTGAACGGG CCTTGGCGAT CAGGGAAGCG GTGTTGGGGG CGGAGCATCC CGCTACGGCG
ACGAGTGTGA ACAATCTGGC GGTGGTCTTG GAGGGTGGTT TGTCAAGAAG TGTGCAAAAT
TAA
 
Protein sequence
MDVPALIPIV ADAILAHCPQ YDRASVMMAL ESVFAGHPAL LGDKTISMLF GQNNDFTNAT 
VTIGDVHAGY TMNVHLPQPL DPTLAALIAR ASLPATIHHG DMTINESGVS GVVVRDNYGS
IHQTIHQPQP IDPLPAALAA LASMPLGYVP EPRADLPHPS RLPFEASPHF VGREEALKQL
AQAIGSAQPA VVMPAVATGL GGIGKTSLVT EFAYRYGVYF YGGVFWLNCA DPDQVASQIA
ACAVGLKLDT TGMALDEQVQ RVLNVWKSPM PRLLIFDNCE DRVLLKRWKP TVGGCRVLVT
SRSNQWPTLT QIQLGVLSPV ESRALLQSLY SDLTDADADL IAEDLGHLPL ALHLAGSYLH
TYRPTITAYR AALSIAHESL HGYGAFDSPT EHEQNVDATF MLSFNQLDPD NTIDALALGM
LDGAAWCAPG VPIPRELVLT FMPDETTHTA RVDALRRLQQ LGLLDGTDAV VLHRLLAQVV
QAHLGSSETL AVVEDRIGAA ASRANGTGVP RSMLPLAPHL RYATMRALDR GDAQAARLAN
SLGFYEDLRG AYAAAEELHE RALAVREAVL GAEHPDTATS VNNLAVVLKR QGRYAEAQRL
FERALAIREA VLGAEHPATA TSVNNLAVVL EGGLSRSVQN