Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1849 |
Symbol | |
ID | 5733738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2150435 |
End bp | 2151484 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641278993 |
Product | extracellular HAF |
Protein accession | YP_001544620 |
Protein GI | 159898373 |
COG category | [S] Function unknown |
COG ID | [COG5563] Predicted integral membrane proteins containing uncharacterized repeats |
TIGRFAM ID | [TIGR02913] probable extracellular repeat, HAF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAGGTA GCAGCAAACG CATCGGTTGT GGATTAATTG TGATTGGATT ATTAGGTCTA ACACAAAGCC AATCGCGATT AATGCACGCC CAAACCCCCA CTCCACCCAA TCCAGGCTAC ACGTCGCGGA CTGTTGGCAC AATGGGCGGC CATGACGCTT ACGCACGGGA TATTAACGAT TTTGGGCGGA TTGCTGGCGG ATCTGAAACC GAATTATCCG ATACCCGAGC CTTTGTTTGG CGACGAGGAA CATTCACCGA GATTCCACCA TTAGGCGGTG GCGACAAAAG CTATGGTCAT GGTATCAACG ATAGTGGGTT CGTTGTTGGC GAAAGTAATA CCTATACCCA TGAACAACGC GCATTTTATT GGACTGGCGG CGATCCGATC GATCTCGGCA CGCTGGGAGG GCCAAGCAGT ATTGCTTTTG ATGTGAGTAA TGGCGAGCGA ATCGTGGGCA GGAGTAGCAC CAGCACTGGG GAAACCCATG CATTTATGTG GTATCGCAAT ACCATAACCG ATTTAGGAAC CCTAGGCGGC ACCTACAGTA CTGCACGCGA GATCAATGAT CACAAGGTCA TTGTGGGGAG TAGTACCAAC GCCAGCGGCG TAAACCGTGC ATGTATCTGG AAAAATGGGC AAATAATCGA CCTTAACATC CCAGCAGTCC AAAGTTATGG GGTTGCGATT AATAATAGCG AACAAGTTGT CGGCACGATG GTCATGACCA ATGGCATCTC GCGGGCCTTT ATATGGGACG ATGGCATTAC GACGATATTA AGCACAACAA CACCCTACAG CACCTTCGCC AACGACATTA ATAATTTAGG CGTGATCGTT GGCACAACCT ATGACACTGA TGGGTATTCC AAGGCTACGG TTTGGCGAAA TGGCACAGTG CTGTATATGT GGCCCGAGGA CACTGGCCTG ACTGAACATC AAACCAGTGC CGCCTCGATT AATGATGCAA ATCAAATTAC TGGCAGCGAT AGAATCAGTG ATTGGGATTA TTTCTCGACA GATATTATGC TTTGGCAGCT CACAGATTAA
|
Protein sequence | MLGSSKRIGC GLIVIGLLGL TQSQSRLMHA QTPTPPNPGY TSRTVGTMGG HDAYARDIND FGRIAGGSET ELSDTRAFVW RRGTFTEIPP LGGGDKSYGH GINDSGFVVG ESNTYTHEQR AFYWTGGDPI DLGTLGGPSS IAFDVSNGER IVGRSSTSTG ETHAFMWYRN TITDLGTLGG TYSTAREIND HKVIVGSSTN ASGVNRACIW KNGQIIDLNI PAVQSYGVAI NNSEQVVGTM VMTNGISRAF IWDDGITTIL STTTPYSTFA NDINNLGVIV GTTYDTDGYS KATVWRNGTV LYMWPEDTGL TEHQTSAASI NDANQITGSD RISDWDYFST DIMLWQLTD
|
| |