Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1819 |
Symbol | |
ID | 5733677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2116902 |
End bp | 2117924 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278962 |
Product | hypothetical protein |
Protein accession | YP_001544590 |
Protein GI | 159898343 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00974885 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCAAGTA AAATTGACCC TAATGATTGG GATGCAAACT TTTCGGCCAA GGGGCCGCGT CAACAAGGCC CACTCGGTGT TTTACTCCAA ACCTTGTTAG TGCTTGGGAT TGCTGGTGGA TTAGGCTATG GGGTTTGGTT GCTTAATGAA ACTATGAGTG AGCAAAATGC TGCAAATGCT GCTACGGCTA CGGCAGTTGC TCCAACGATA TTTGCGCGGG CCACCCAACG CTCAATCCAA GCAACCCAAG ATGCGATTCC AACCGCGACC CCTAACTTGC CAATTGGGCG TTCGCTGGCA ACCTCGGGGC TACGATTTGA GCCAAATGAA CGCTCTCAAT CCAAAGGTAA TGTCAATCCA AATGATAGCC TGCAATATAT CGAATCACGT GAGGTTGGTG GGGTGCTTTG GTGGAATGTG CGCTTGTTTG AACGAGCAAA CATGGATTTG CCCGCAGCCG AGTTAGGTAC CAGTGGTTGG GTTTTGGCTA CGACTGTGAC TGAACCAAGC GCACCTCCGC CGGTTGTCGC CGATTCAACC GCTGTTCCAG CTGGCCCTGT CGATTTACCA ACTACGCCAA TTGATCCAAG TAGCGTGATT TTAACCCGCA ACGCGAACAC CAATATGAGT GTTTCACACC CCCAAGCCTG GACAGAATAT GTGATTGGCG AAAACCTTGC TGTTTTTTTC TCAACCGCAG TCGATGGCGA GCAAGGCATC TTGGCGAATA AAGTGATTGG TAAAAACAAC GATGTAGCTG GCATTCAAGC GGCTATCGAA GAGACCATAA ATATCCTAGC AGCACCCAAT AGCCAACGTG AATATGCTAT TAATGCGTCG GAAGGAACGG TTAAGTTTGT GCGGGTTGTG CGTGGTCAGG AGGCTAAAAT TCAGGCGTTC GTTACGGTTA AGCCTGGCCC AGGTGGTAGT TTAGGCGTGA TTGTCGGCTT TGTGCCCGAA TCAGGCTTTG CTGCCAACCA AGCTATTTTG CAACAAATGA CCCGTAGCGC AATTATTCAA TAA
|
Protein sequence | MPSKIDPNDW DANFSAKGPR QQGPLGVLLQ TLLVLGIAGG LGYGVWLLNE TMSEQNAANA ATATAVAPTI FARATQRSIQ ATQDAIPTAT PNLPIGRSLA TSGLRFEPNE RSQSKGNVNP NDSLQYIESR EVGGVLWWNV RLFERANMDL PAAELGTSGW VLATTVTEPS APPPVVADST AVPAGPVDLP TTPIDPSSVI LTRNANTNMS VSHPQAWTEY VIGENLAVFF STAVDGEQGI LANKVIGKNN DVAGIQAAIE ETINILAAPN SQREYAINAS EGTVKFVRVV RGQEAKIQAF VTVKPGPGGS LGVIVGFVPE SGFAANQAIL QQMTRSAIIQ
|
| |