Gene Haur_0788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0788 
Symbol 
ID5732672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp889415 
End bp891352 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content49% 
IMG OID641277918 
ProductLVIVD repeat-containing protein 
Protein accessionYP_001543564 
Protein GI159897317 
COG category[S] Function unknown 
COG ID[COG5276] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0658316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTCGTC GTTTTTTTCG TTGTAGTTTG CTGCTGATGC TGGCCGTTGG AGCAATCGCT 
AGCCAGCATG CCACCCCGCT GGCCGCCCAA AGCCCTCAGT CAAGCCCAAG CCAATTGGCA
ATTCGTGGTC AACTTGGCGG TGATAGCTCA ACCATGGTCG TTGCCGAACA AGCTGCTTAT
GTTGGCATCG GTCCACGGGT CGTAACCTAC GATTTGCACA ATCCCAATCA GCCAAATCTA
GCCAACCAAA GCAGCCTTTT ACCCGCCACC GTGCAAGATT TGGCCTTAGG CGAGGGCTAT
TTGTACGCCG CCTTGGGCTA TGGCCGTGAA CATGGCTTGG CTACATTTTC CTTAGCCACT
CCATTAACAC CAACGCTTGT AAGTTTTTAC CCGCTTGAAT TATCGGCCAT GAGTGTTTTC
AGCTATCAAG ATTATGTCTA TGTGCAAGTT GATGATGGCT TGCAGGTGTT CGATTTGAGC
AATCCTAGCC AACCACAATT GGTCAATCAA CTTAATCAAT ATGGCATTGC CGCCGCAACG
GTTCAAGGCG ATCATGTGTA TCTCAATATT CAATCGTGGC TTTCGGGCTT CGATAGCTTG
AATATCGTCG ATTTGAGCAC ACCCATGACT CCAACGCTCA AAGGTTCGAT CGTGCTTGGC
GATATTTTAG ATATCGAAAT TGCCGCTAAT TATGCGTATG TCAATACTCA AATGGGCCTG
CGGGTGGTTG ATATCAGCAA TGGCAGCAAC CCAAACATTA TCAACCAAAC CGATGATGTT
GCTGGATTAC AACTAACCGC AAGTGGCAAC ACACTCACAA TGATTGGGCT AGATTATGCC
TTACACACGT GGGATATTAC CAACCCAATC ACGCCGACCG AAATTGCTGC CAGCTCCGAA
ACGGTGCTTG GCTGGCCAAC AGCCTTAATT GCCCAACAAA ACCTTGGCTA TGTGCTGACT
CGTTCAGGCC AAATTAATGT CTTCGATTGG AGCACTCGCC AAGCTCCCAC GCGCATCAGT
GCGACTGAAA CCCCTGGCTG CAAGGAAAAT GTAGCCTTGC AAGTGGTTGG CGATTATGCC
TATGTGGCCG ATTGGGGCCG TGGTTTGTGC ATTGTTGATG TGCGCAATCC CCAACAACCA
AGCGTGATTG GCCGCTTCGA TTTAACCACC GATCGCAACG CCGTCACCGA TATTGCCGTC
CAAGGCTCGT TGGTATATAT GGTCGATTGG AGCCACGGAA TTTATGTGAT CGATGTCAGC
GACCCAACCG AACCAACTCA AGTCAGTTTC TTTGCAACCG CTGGCTTTCC AGCAGCAATT
GCGGTCAATG GCCCGATGGT CTATGTTGGC GAATCGATCA ACGATCAAGG CGTTGGTGGC
AGTGTACGGA TTTTCGATCT CAGCGATTTG GCCAATCCAG TTGAGCATAG TCAGTTCTAC
ACCAGCAAAG TCGCCCAATT AGCCGTGATC GATCAAACCG TGTATGTGGC CGACAACGAA
GGCGGCCTGC GGATTCTCGA TGTTAGCAAT CCTGATAACA TTCAGATGAT CGGCCAATAT
GCTGAAAATT GGTTCGCCAT GGATTTAGCG ATTCAGGATG AGTATGTGTA TTTGGCGACG
ACTAGCGGCT TAGAAATTGT CGATATCAGC AACCCCAGCC AACCAATCCC AGCAGGCCGC
CTTGGCGATC GCTTGATTGA TGGCATCGTT GTGGCGGGCG ATAATGCGTA TATTGGTGAA
AATGGTAGCA TTCGCCAAAT CGATATCAGC CAACCGAGCA ACCCTCAAGT TGTAGCCGAA
ACCCTGCTAG CTCGCGATTC ATGGATTCTA CCAGCGCTGC ATGGTGGAGA GATTGTAGGC
TTGAGCTATC ATGGTGGTGG CATGTTCGTA CTTGAGCCAG AATATCAATT ATTTTTGCCT
GTAATCAGCA AAAACTAA
 
Protein sequence
MVRRFFRCSL LLMLAVGAIA SQHATPLAAQ SPQSSPSQLA IRGQLGGDSS TMVVAEQAAY 
VGIGPRVVTY DLHNPNQPNL ANQSSLLPAT VQDLALGEGY LYAALGYGRE HGLATFSLAT
PLTPTLVSFY PLELSAMSVF SYQDYVYVQV DDGLQVFDLS NPSQPQLVNQ LNQYGIAAAT
VQGDHVYLNI QSWLSGFDSL NIVDLSTPMT PTLKGSIVLG DILDIEIAAN YAYVNTQMGL
RVVDISNGSN PNIINQTDDV AGLQLTASGN TLTMIGLDYA LHTWDITNPI TPTEIAASSE
TVLGWPTALI AQQNLGYVLT RSGQINVFDW STRQAPTRIS ATETPGCKEN VALQVVGDYA
YVADWGRGLC IVDVRNPQQP SVIGRFDLTT DRNAVTDIAV QGSLVYMVDW SHGIYVIDVS
DPTEPTQVSF FATAGFPAAI AVNGPMVYVG ESINDQGVGG SVRIFDLSDL ANPVEHSQFY
TSKVAQLAVI DQTVYVADNE GGLRILDVSN PDNIQMIGQY AENWFAMDLA IQDEYVYLAT
TSGLEIVDIS NPSQPIPAGR LGDRLIDGIV VAGDNAYIGE NGSIRQIDIS QPSNPQVVAE
TLLARDSWIL PALHGGEIVG LSYHGGGMFV LEPEYQLFLP VISKN