Gene Haur_2373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2373 
Symbol 
ID5734254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3024060 
End bp3026816 
Gene Length2757 bp 
Protein Length918 aa 
Translation table11 
GC content50% 
IMG OID641279514 
Producttetratricopeptide TPR_3 
Protein accessionYP_001545141 
Protein GI159898894 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGCTC CTGAGCGAAT GCTTGAGCGG CAGCAACGCT ATCAGGTAGC TGACGACCAT 
GAGTTTGCAT GGCGACAATA TTGTCTTGGT TTGGCCAACC TTCGGCTTCA GCGGCTTGCG
GCGTTGCAGT TGGCCCTGAG CCAAGCCCGC CAAGCATTTG GGCTGAGCCA TGATCCGCTT
GGCCTTTTGT ATTGTGAGGG TTTGGCTTTA GCTGGGGCTT GGCTCGCTGG AACAAATGTC
GAGAATTTGC AGCGACTGGA AGGCTTGGCT CACCGCTTTC GTGCGCTTGG CTATCCACTT
GAGGCAACTC GGACGGATCT GCTGCGGGTT TTGCATTATT ATAGTCTTGG TCAGGTTGAG
CGAGGGTTGA GCTTGATTCA GCCGCTTGAA GCCTCGATTG ATCAGTTGGG CAGCAGATAC
GACCAAGCCT ATTGGCTATG GATTAGTGGG ATGACCTATA GTCATAACCA TGATTTTAAC
CAAGCTGGTC GTCTGCTTGA TCAGGCATTT GTGCTTTTTA GCAGCTTGCA CATGCAACCA
GAGTGTGCCC GTTGCTTGTT TGATCGTTCA TGGTTGTGGC AGCGCCAAGA GCGCTATGCT
GCGGCCTTTA ACGACCTCGC GGCGGCTCAG ACACTTGCCG ATGATCTGAA TTTGGTTTAT
TTGATTGCCG CATGTGCCAA AAATACAGGC TTGGCGTTAA GTCGTTTGGG GCGTTATGCT
GCCGCGTTGG ATTGGACACT AGATGCCCGT CGGCGCTATG GGCAGCTGGC GCGGCACGAT
TTTATCAATG GTTGCGAGCT GAATCTCGCC AATATTCTCT ATTATGCTGG CTTGTATGAG
CTTGCGCTTA AAATGTTCGA ACGGCTTGAG GGGAATTATC GTCGGCTAAA TTTGCTGCGG
ATGGCAGCTG GGAGTGTCCG CAATCAGGCA CTTGTATTGC GTGCTCAACA TCAAGCACAG
GCTGCCTATG ATCTATTGCG TTCGATTGAA GCGCCGATTT TGCAGGGTAC TAATCAATTA
GAACAAGCCG AATATTGGCA AGCGCTGGCT TTTGCACTTG CTGAGCAGGG TGATATTGAG
CAAGCAACCA CCATCTTCGG CCACGCCGAA AGCCTTTTTA AGCAACTTGA TAATCAACCT
GCTGCTGCAA AATGCCGTTT AGAGTTAGCT TGGCTGGCGC TTAAACAACA TCAATCAACA
GATGATTCTG ATAATGCCCA GATACACACA ATTAAACACC AGCTTCAGCA AACTTTGGGC
TATTTGGATG ATCGCCCATT CTATCGCTGG CGCATTTTGT ATGGGCTAGG TTTGTGCGAG
GCGGCCTTTG GTAAAACCGC AGCAGCCCTT GATTACTATA CCCAAGTTAG TCTTATTATC
ACCAATTTGC GCCTCGAATT TTTCGATGAA CATGCCTCAA GTGCGATCTT TGTGCAGGCC
CACGATCTTT TTACGGCGGC GCTTGATTTG ACTCTCCAAG CGAATAATCC GCTGGCCTTA
TTGCAATTGA GCGAACAACA ACGCGCCTTG GTATTACGCC AACATCTGGA TTCAGGGCGC
TCCAATCGTC GGATTCCGCG GGCGTTGCAG GCACAGTTGT GGCGTGGTTT GCGCACGGAT
AGTCTGGGTT TCTTGGATGA TCTGCCGCCT GTTGGCAAAC TTGAGATTCC AGAGATTCAA
TTTGACATCA CTGAAGCCTT TGATCAGCTT GATATTGCCC AGCTTCGGCA GCATTTAACG
GCGGTATATC AGGCGCGTTG GGTGGTGTTG GCCTATATTT GTCATGGCTC ACGGTTAATT
CGCTTAACCC TCACCCCAAC CATGCTGCAA GCTGATGTGA TTGTGCTTGA TGCTGGCTTG
AAACGCTTGA TCGAACGAGC AAGTTTGGCA CGATATCGCT GGTTTACCTA TAGTGGTCAG
CAAGCAAGCG ATCAGCCGAT TTGGCCTGAG CTTGAGTCGC TTGCGTCTCA CTTATTGCCC
CCAATTGTTT CACAACCCCT TGATTACTTG CTGATTGTGC CCGCTGAGCC GTTACATGGA
ATTGCATGGG GCACGTTACG ACTTAACCAA AAATGGTTGG CTCAGCATAC AGCGATTAAT
ATTCTGCCTA ATCTTAGCCA CATTCAACCA ACCGCCATGG TGGTCAAACC ATATAGTCAG
GGGGTTATGA TTGGGTGTAG CGAGTTTCGT GATGATTTGC CAGCGTTGCC AAATGTTGAA
CCTGAAATAG CCCAACTTGA AGCAATTTAT GCTGATCAGC CTACTCTGAC CCTCCAGAAT
GCGGCTGTAA CCAAAGCCCG AATCTTCGAG CTATCCAACG CCGGGAGCTT GCTAGGCTGT
CGATTTTTGC ATATTGCCAG TCATGCCCAG CTACGCTCTG GCGCAGGCCA AGCTGCCTAT
ATTCAATTGT GGGATGAGCG TTTAAGCTTC GATCAGATTA TTGATCTGCA ACTCCAAGGA
ACATGGGTTA TTTTATCGGT CTGTGATGGC TCGGCGAGCG ATGTTTTGGC GGGCGAAGAG
GTGTTGAGCT TGAGTCGGGC TTTTTTGGCA GCAGGAGCTA CGGCGGTAAT TGCCAACCTT
TGGAAGCAAG CAGATGATGC TTCGCCTCGC TTGATGGCCC GCTTGCATAG CTTGCTCCAT
GCTGGTAGCG ACCCAGTACG AGCACTATGT TTGCTCCAAA ATGATTGGCT TATGCACTAT
GATAACGCTT CTCCCTTAGT TTGGGGCGGG CTTCAAGTTA TTAGTAGTTT GGCCTAG
 
Protein sequence
MHAPERMLER QQRYQVADDH EFAWRQYCLG LANLRLQRLA ALQLALSQAR QAFGLSHDPL 
GLLYCEGLAL AGAWLAGTNV ENLQRLEGLA HRFRALGYPL EATRTDLLRV LHYYSLGQVE
RGLSLIQPLE ASIDQLGSRY DQAYWLWISG MTYSHNHDFN QAGRLLDQAF VLFSSLHMQP
ECARCLFDRS WLWQRQERYA AAFNDLAAAQ TLADDLNLVY LIAACAKNTG LALSRLGRYA
AALDWTLDAR RRYGQLARHD FINGCELNLA NILYYAGLYE LALKMFERLE GNYRRLNLLR
MAAGSVRNQA LVLRAQHQAQ AAYDLLRSIE APILQGTNQL EQAEYWQALA FALAEQGDIE
QATTIFGHAE SLFKQLDNQP AAAKCRLELA WLALKQHQST DDSDNAQIHT IKHQLQQTLG
YLDDRPFYRW RILYGLGLCE AAFGKTAAAL DYYTQVSLII TNLRLEFFDE HASSAIFVQA
HDLFTAALDL TLQANNPLAL LQLSEQQRAL VLRQHLDSGR SNRRIPRALQ AQLWRGLRTD
SLGFLDDLPP VGKLEIPEIQ FDITEAFDQL DIAQLRQHLT AVYQARWVVL AYICHGSRLI
RLTLTPTMLQ ADVIVLDAGL KRLIERASLA RYRWFTYSGQ QASDQPIWPE LESLASHLLP
PIVSQPLDYL LIVPAEPLHG IAWGTLRLNQ KWLAQHTAIN ILPNLSHIQP TAMVVKPYSQ
GVMIGCSEFR DDLPALPNVE PEIAQLEAIY ADQPTLTLQN AAVTKARIFE LSNAGSLLGC
RFLHIASHAQ LRSGAGQAAY IQLWDERLSF DQIIDLQLQG TWVILSVCDG SASDVLAGEE
VLSLSRAFLA AGATAVIANL WKQADDASPR LMARLHSLLH AGSDPVRALC LLQNDWLMHY
DNASPLVWGG LQVISSLA