Gene Haur_0935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0935 
Symbol 
ID5732821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1067836 
End bp1070808 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content53% 
IMG OID641278067 
Productkelch repeat-containing protein 
Protein accessionYP_001543711 
Protein GI159897464 
COG category[S] Function unknown 
COG ID[COG3055] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.35348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCTTTT TCGCTCGTTC AAGGGATTTG TTGCCCCGCT TGCTGAGCGG GTTGCTGCTG 
CTGACCTTGA TCATTACGGG GTTGTTGGTG CAGTCGCAGG CCCAAGCTCA AACTGCTCAA
CCACAGGCTG TTGCCACTAG TGCCACAACG ACGATGTTTA GTTGGCAAGA TGGCGCAACC
GTGCCGCAAC CACGTTTTGA ATCGCAAGGG GTCTTTGTCA ATGGCAAGTT GTATGTAATT
GGTGGCTTTA TTAGTTGTTG TACCCAAATT AATGCTACCG ATTTGGTCGA TGTTTATGAT
TTGGCCAGTA ATACGTGGCA ACGGATCGCC AGTATTCCCG AGGCGATTAG CCATGCCCCA
GTTGTTGCCG ATGGCACCAA TATCTATGTG CTCGGCGGGT ATTTGGGCAA TAACCCAGGT
GGTAGCACCA ACCATGTGTG GAAGTTGAAT ACAGTTACTA ATACATGGAC GCGTGGTATC
GACTTGCCCG TTGCTCGCGG TGGCGCTGGT GCAGCGATTG TTAATCGTAA AATTTATTTC
TTCGGTGGGG CTGTGCGTAC CGCTGGTGTG TTTGATGATA CTGACTTCGG CGATCATTAT
ATGCTCGATT TAAGCCTCGC TAACCCAACT TGGGTTTCAC GCGCTGCCAT GCCCAACCCA
CGTAACCATA CTGCTGCTGG GGTGGTCGAT GGCAAAATTT ACGCCGTTGG TGGCCAGCAT
GGCAAAGCCG AAGAATCGGC CAATCAGGCC GAGGTTGATC GCTATGATCC CGCAACCAAT
ACGTGGACGC GGGTTGCCGA TATGCCCATT CCCAAAGGCC ATACTTCTTC ATCAACCTTT
GGCTATCGTG GCCGTTTATT GGTGATTGGT GGCTCGATCA ATGGTGGTAC CAGCGGCCTT
GCTTCGGCTG ATGTGCTGAT GTACGACCCC AAAAGCGATG TTTGGATGAA GTTGGTTTCG
TTGCCAGCCT ATCGTAAAAC CCCAGTTGCC GATGTTTGGA ATAACAAACT TGTGGTGACG
ACTGGCGGCG GTTATGGCCA AACGGATACC ACTTGGATTG GCAATTTGCC CGATTCATGG
GAATCTAACG GCACCATGCC CGTACCCTTG GGCGAGGTTG CCAGTGGGAT TATCGGCAAT
AAAATGTATG TGGTTGGTCA AGGCAATGCT GCAACCTTGC GCTACAACTT GACGACTGGC
AGCTATAGCT CAACCACCAC TTTGGCTCAA CGGGCCTTGC GTGGCAATCA CCATGCCGCC
GAAGTAATAA ACGGTAAATT CTACTTATTT GGTGGCCTCG ATAATAACAG TGATGGCAAA
GTCCAAATCT ATGATCCTGT GTCCAACACC TGGGCCATGG GCGCGGATAT GCCGTTTGCC
GCTGGGGCTA GCTCCTCAGC CGTGATCAAT GGCTATGCCT ATGTTGCTGG CGGGATTATC
AATGGCAGTA CAACCACCCG TGTAGCCCGC TATGATCCAG TCGCGAATAC TTGGACTGAA
GTTGCCGCCA TGCCGCGCGG GCGCAATCAC GCTGCTTCGG CGACTGATGG TAGCAAATTG
TATGTCTTTG GTGGGCGTGG GCCAGGTAGT GGCGATGGCA ATAATGTTGC CAATGGCTTT
GATACTGTCC AAATTTACGA TCCTGTTGCT AATTCATGGC GCTCAAGCAT GACCGATACA
ACGATTGCGC CGTTGCCGCA AGCCCGTGGT GGGATGGGCA AAGCGGTTTA TTATGACGGC
GAGTTCTATA TTTTTGGCGG TGAAACGCTT GATGGTGCAG GCGCGGTCAC TGGCAATGTG
TATGATCGGG TTGATATTTA TAACCCAATT CTTAACCGCT GGCGCACTGG TTCGCCTATG
CCGACCGCCC GCCATGGCAT CTTCCCTGTG ATTAACGGCG GACGAATTTA TATTGCTGGG
GGTGGCACGG TTTCGGGCTT TAGCCAAACC AACATCACCG AAATTTATAA TCCAGGGATT
GCTGCCGACG AAAATGCCCA CCTTGAAAGC AATGGTCAAG TTGGCTTCGA TTCCGAGCAA
GTCCACGGCA ATACCGCGCG TGGTCAGTGG TCGTGGCAAA CTCGCAATGA TGTTGATCGG
CCTGGCTTCA GCGGCAACAA CTACTTGGCG GTCTTGCCCA ATACTGGCAC CCTTTCGGAT
ACCAATTACA CCACGATGGC TCCACAACTG GATTATCGCG TGAAGTTCAC CACGCCTGGT
ACCTATTATG TGTGGCTGCG TGGCTGGGCC GATAGTGGCG GCGATAATTC GGTGCACCTT
GGCCTGAATG GTCAGCAATC AAGCTCATCG GATAAGATGA CCTTGCCAAT TTTCAGCGCT
TGGCGTTGGT TCCGCGATAC AACCGATGGT GTGCCTTCAA CTATCACTAT TCCAAGTGCT
GGAGTATACA CGCTGAATTT GTGGATGCGT GAGGATGGTT TCCGTGTTGA TCGCTTGTAT
CTGACGACTG ATGTTAATTT TGTGCCAAGC GATGCCCAAT TAACCCCAAT TCCGACGACT
GCACCAACCG CGACGGTGAG CAATCCGACC GCGACCAATA CCGCGACGGC AACCAATACA
CCAACCAACA CACCAACCAC CGTGCCGCCA ACGGCGACCG ATACCGCTAC GGCGACCGAG
GTTCCAACTA ATACGCCAAC CGCCACGACG GTTCCGCCAA CGGCAACCGA TACTCCGACT
TCAACCAATA CGCCTGAACC GTCAACCACC GAGCCAGCGA CCAACACGCC GACTCCGACG
ATTACGGTGA CGGCAACCAA TACAACCACG GCTGTGCCGA CAACAGCAGT GCCGACAACA
GCATTGCCAA CCACGGCTGT GCCAACGGTA ACTGAGACGC AAACTCCGAC CGCAACTGTC
ACAGTAGGCA CGATTACGCC AAGTGAAACG CCAACCGCGA CAGTCCCGAC CAGCGAAACA
CATCAAGTCT ATCTACCTTG GGCTTCTAAA TAA
 
Protein sequence
MVFFARSRDL LPRLLSGLLL LTLIITGLLV QSQAQAQTAQ PQAVATSATT TMFSWQDGAT 
VPQPRFESQG VFVNGKLYVI GGFISCCTQI NATDLVDVYD LASNTWQRIA SIPEAISHAP
VVADGTNIYV LGGYLGNNPG GSTNHVWKLN TVTNTWTRGI DLPVARGGAG AAIVNRKIYF
FGGAVRTAGV FDDTDFGDHY MLDLSLANPT WVSRAAMPNP RNHTAAGVVD GKIYAVGGQH
GKAEESANQA EVDRYDPATN TWTRVADMPI PKGHTSSSTF GYRGRLLVIG GSINGGTSGL
ASADVLMYDP KSDVWMKLVS LPAYRKTPVA DVWNNKLVVT TGGGYGQTDT TWIGNLPDSW
ESNGTMPVPL GEVASGIIGN KMYVVGQGNA ATLRYNLTTG SYSSTTTLAQ RALRGNHHAA
EVINGKFYLF GGLDNNSDGK VQIYDPVSNT WAMGADMPFA AGASSSAVIN GYAYVAGGII
NGSTTTRVAR YDPVANTWTE VAAMPRGRNH AASATDGSKL YVFGGRGPGS GDGNNVANGF
DTVQIYDPVA NSWRSSMTDT TIAPLPQARG GMGKAVYYDG EFYIFGGETL DGAGAVTGNV
YDRVDIYNPI LNRWRTGSPM PTARHGIFPV INGGRIYIAG GGTVSGFSQT NITEIYNPGI
AADENAHLES NGQVGFDSEQ VHGNTARGQW SWQTRNDVDR PGFSGNNYLA VLPNTGTLSD
TNYTTMAPQL DYRVKFTTPG TYYVWLRGWA DSGGDNSVHL GLNGQQSSSS DKMTLPIFSA
WRWFRDTTDG VPSTITIPSA GVYTLNLWMR EDGFRVDRLY LTTDVNFVPS DAQLTPIPTT
APTATVSNPT ATNTATATNT PTNTPTTVPP TATDTATATE VPTNTPTATT VPPTATDTPT
STNTPEPSTT EPATNTPTPT ITVTATNTTT AVPTTAVPTT ALPTTAVPTV TETQTPTATV
TVGTITPSET PTATVPTSET HQVYLPWASK