Gene Haur_1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1981 
Symbol 
ID5733870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2435772 
End bp2436806 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content48% 
IMG OID641279125 
Productextracellular HAF 
Protein accessionYP_001544752 
Protein GI159898505 
COG category[S] Function unknown 
COG ID[COG5563] Predicted integral membrane proteins containing uncharacterized repeats 
TIGRFAM ID[TIGR02913] probable extracellular repeat, HAF family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTCAT TTAAAACTTT TTGGTTGCTG CTGCTCGTCG TTATGGCACT TGGCTGGTTT 
GTTGTTCAGC GCTCGCCGCT TCAGGCCCAA ACGCCAAATC CCAACTATAG TTATGTCAAT
CTTGGGGCAC TAGGCGGGCA ACATATGTAT CCTAGTGATA TTAATGATTT TGGACGAATC
GCTGGGAGTG TAGAAACCGA GTTCTCAGCA ATGCGAGCGT TCGTTTGGCG ACGAGGTACG
CTCAGCAATC TGGGCACACT CGGCGGCAAT CAAAGCTATG GCTATGGCAT CAATGATACT
GGCTATGTTG TCGGTGAAAG CACTACGAGT AATAACAAAC GGCAGGCTTT TTATTGGCGC
GAAGAGCAAA TGCTCAATCT TGGCACCCTC GGTGGTAATG TTAGCACAGC GCTTGATGTC
AGCAATGGCG AGCGGATCGT TGGCCGAAGC ACGACCAGTA CTGGCGATAC CCATGCATTT
ATGTGGTATC GCAATACGAT GACCGATCTT GGTACGCTGG GGGGCAACTA CAGCACCGCC
AATGAAATCA ACGATCACAA AGTTATTGTC GGTTGGAGCA CCAATGCCAA CGGTGAAACT
CGCGCCTGTA TCTGGAAAAA CGGTACGATT ATCGATCTAG GCATACCTGC GGTTAAAAGT
TATGGCTATG CAATCAATAA CAATGAGCAA GTTGTGGGAA TGATGGAATT AAGTGATGGT
CAACGCCATG CATTTCTTTG GGAGAATGGC GTAACCACCG ATTTAAGCGC CGGATTGAAT
CAATATAGTG GTGCAAATGA TATTAACGAT GCAGGCACAA TCGTTGGGTT TACTGGTGAC
GACACAACAC CACTTGCTGC AACGGTTTGG CATAATGGCA CACGTTTGCG GATGGGGCCA
TTCAGTCAAG CAAGCACCGA ATATCAAACG ATTGCAACTG CGATTAATGA GGCCAACCAA
ATTGCTGGTT ATGCTATCGT GAGCGCTGAT GGCGTTACGC GCACCGACGG AATAATTTGG
CAATTTGAAG ATTAA
 
Protein sequence
MRSFKTFWLL LLVVMALGWF VVQRSPLQAQ TPNPNYSYVN LGALGGQHMY PSDINDFGRI 
AGSVETEFSA MRAFVWRRGT LSNLGTLGGN QSYGYGINDT GYVVGESTTS NNKRQAFYWR
EEQMLNLGTL GGNVSTALDV SNGERIVGRS TTSTGDTHAF MWYRNTMTDL GTLGGNYSTA
NEINDHKVIV GWSTNANGET RACIWKNGTI IDLGIPAVKS YGYAINNNEQ VVGMMELSDG
QRHAFLWENG VTTDLSAGLN QYSGANDIND AGTIVGFTGD DTTPLAATVW HNGTRLRMGP
FSQASTEYQT IATAINEANQ IAGYAIVSAD GVTRTDGIIW QFED