Gene Haur_1229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1229 
Symbol 
ID5733122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1422206 
End bp1424233 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content50% 
IMG OID641278369 
ProductPKD domain-containing protein 
Protein accessionYP_001544005 
Protein GI159897758 
COG category[S] Function unknown 
COG ID[COG5276] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACAC ACAATGGACG TTGGTCGCGT TCGTTAGCGG TCGGTTTTGC GGTGTTGGCG 
GCGTTTGTGT TAGCGCCAAT TGCTTCGATT TCGGCCCATA GCAATCATGA TCATAAGATT
TATGTTGCAC CATTTGGCAA AGATGAGGGT GATTGTTCGA AGATTTGGCA GCCCTGTGCG
ACGATCGATT ATGCGATTAG CCGCGGGACT GGCAAGGGTG ATGAAATTCG GGTTGCTGCT
GGCGATTATA GCTACGATGC TCAGGCGGTT GGCTTATTGC TGAGCCGCCA GATTCCGGTG
CACGGTGGTT TTAGCGCTCG TGATAATTTC ATGCAGCAGG ATACCAAGGC CAATCCAACC
TATATCGCTG GGATTCCTAG CCGCTACCGC GAACGCTTGG CGGCAATTGG CTTTGCGCGG
GTCGAAGATC GCGATGGTCG CAATACCCCA GAACTTGAGC GGATGCAAAT TCCAACTGAA
TTGGAAACGC CAAGTGCTGC TCAGCCTTGT ACCAATGGTA TGGCTGGCAT CTTTGAGTGC
AACGGCATCG ACTTCTTGGG CAGCGTGCCG CTCTCAAGTT TTCGGGTAAA TGGTAGTGGA
GCCACCGATG GCAGCAACTT GTGGGGCCAC GTCGATCTCA ATACCGATCG CGAATATGCG
ATTATTGGGG TTAACAATGG CACTGGCGTA GTTGATGTGA CTGATCCCAC CAATCCGGTG
GTGATTGGCA CGGTGCCAGG TAATAATTCG CAATGGCGCG AAGTTAAGGT CTATCAATAT
TTCAACCAAG CCCAAAATCG CTGGAACGCC TATGCCTATC TTTCAACCGA AAATCGTACC
CAAGGGTTGC AGATCGTTGA TCTCAATCAC TTGAGTGATC CAACGCCTTC GGTTAGTTTG
GCGGCAACCT ATACCGCCGA TTTTGGCTCA TCGCACACTG TCTATATTAA GAATGTTGAT
TTCAGCACCA ACGTAGCGTT GCCTGGTAAA ACTGCGGCGT TGTATATGAA CGGGGTCGGC
AAAAACGGCA GTCGCTTGAG TTCAGGTGTG TTCCGCGCCT TTGATATCAG CAATCCGTTG
ACTCCGCAGA TTATTGATAG TGGCGTGCCG GATACCAATA TTAGCTACAC CCACGACGAT
ACCAGCATGA TCATTACCGA TTCGCGGACT TCGGCATGTG CGCCTGGTCA CCAAGCCTCG
TGCGAAATAA TGTTTGATTT CAGCGAATCG TCGGTCGAAA TTTGGGATAT TACCGATTCG
GCTGCCCCAT TCCATATTAG TTCACGGCCT TATTCGGGCA GTGGCTACAC CCACTCAGGC
TGGTATAGCG ACGATAAAAT GTACGTCTTT ATCCAAGATG AATTGGATGA ACAAAATTTT
GGCCACAACA CTCGTGTGCG CACGATGGAT ATTCATGATC TGGATAATCC AACCATCAGC
GCGACGTGGG ATGGCCCAAC TCGGGCAATT GACCATAACG GCTACACGAT TGGCAACAAA
TATTATATGT CGAACTATTT GCGTGGTTTG ACGATTCTTG ACATAACCAA TCCAAATAAC
ATCCAAGAAG CCGCTTTCTT TGATACCTAT CCTGGTAGCA ATTCGGCCAG TTTTGATGGA
GCTTGGGGGG TTTATCCCTA CTTGCCAAGC GGCACCTTGA TGATCAGCGA TATTTCACGC
GGCTTGATTT TGGTGCGCGA ACCAACTAGT ACCCCTGATC AAGCGATTGC TGGCTTGGAA
GCAACCAACG ATGGCCCAAC CGTTGCTGGC GAGGCGACCA ATTTTGATGC AGCAATTCGG
GCTGGTACAA ACGTGACCTA TGCGTGGGAT TTTGGTGATG GTACAGCTGT GGTAACTTCA
ACCAACACCA CGATGAGTCA TACCTATCCA AACGTTGGTA ATTATACGGT TGAGTTAACT
GCCAGCAATG GTACTAATTC GCAAACAGCT ACAACAACCG TGGTGGTGCA AGCGCCACCA
CAAACCGAAT GGAAAATTTG GCTGCCCTTT GCGATTCGGG CGGAATAA
 
Protein sequence
MHTHNGRWSR SLAVGFAVLA AFVLAPIASI SAHSNHDHKI YVAPFGKDEG DCSKIWQPCA 
TIDYAISRGT GKGDEIRVAA GDYSYDAQAV GLLLSRQIPV HGGFSARDNF MQQDTKANPT
YIAGIPSRYR ERLAAIGFAR VEDRDGRNTP ELERMQIPTE LETPSAAQPC TNGMAGIFEC
NGIDFLGSVP LSSFRVNGSG ATDGSNLWGH VDLNTDREYA IIGVNNGTGV VDVTDPTNPV
VIGTVPGNNS QWREVKVYQY FNQAQNRWNA YAYLSTENRT QGLQIVDLNH LSDPTPSVSL
AATYTADFGS SHTVYIKNVD FSTNVALPGK TAALYMNGVG KNGSRLSSGV FRAFDISNPL
TPQIIDSGVP DTNISYTHDD TSMIITDSRT SACAPGHQAS CEIMFDFSES SVEIWDITDS
AAPFHISSRP YSGSGYTHSG WYSDDKMYVF IQDELDEQNF GHNTRVRTMD IHDLDNPTIS
ATWDGPTRAI DHNGYTIGNK YYMSNYLRGL TILDITNPNN IQEAAFFDTY PGSNSASFDG
AWGVYPYLPS GTLMISDISR GLILVREPTS TPDQAIAGLE ATNDGPTVAG EATNFDAAIR
AGTNVTYAWD FGDGTAVVTS TNTTMSHTYP NVGNYTVELT ASNGTNSQTA TTTVVVQAPP
QTEWKIWLPF AIRAE