Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3861 |
Symbol | |
ID | 5735740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4849564 |
End bp | 4850613 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641281012 |
Product | hypothetical protein |
Protein accession | YP_001546623 |
Protein GI | 159900376 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGTT CAAGCTCACT ATTCACATTA TTAATCGTGG CAATCTTATT GATTCCAACA TCATCACAAG CAACCACGAG TACCTCAACT CGCCGCTTTA TCGATCAGCC TTGGCTGAAT CTTTATTATA CAAATTTGGT ATGGGAAGAT ACGGGTAAAT GTCGGGCGAT GTTCCGCGTG AAAACTACGG ATTTATGCAC CCATGGCCCT GATTCGCCTG AACCTCAAAT GGCTAAAAGT TGGGCTGAAG AATTATTTTA TCCTAACAAT CAAATGCATT TGTCAATAGA GCAATCATCC CAACCTTGCC TTGACGATAG TACCAACGAC TATCGAACTC AAGTTTTATA TGTGCGGGCT AGCGATGCCC AAGATAATAG CTCGATCATC GCGCCCGTTT TACGCAAGCA TTTATTGGGT GTAAACGAAA TCTATAACCA AAGTGCCCAA TTAACTGGTG GTCAGCGCCA TATTCGTTGG CAACTTAACC AAGCCTGCCA AATTGACGTG CAGCATGTGG TTATTCCGCC AGCAGCCGAT GATGATTTTG GCGCAACGAT CAATGCCGTG GTGGCCCAAG GCTTCAATCG TGATGATCGT AAATATGTGA TGTTTGTTGA AGCCCGCTAT TACTGTGGGA TTGCCACGGT GCAGCACGAT ACGCAACCAA GCAGTGCCAA TTTAAATAAC CATAGCGTAG CCTATGCCCG GATTGATGCT GGCTGTTGGA ATGCTCGCGT AGTCGCTCAC GAACATATGC ATATGATTGG TGGAGTTCAG CCAAGTGCGC CGCATACCAC CAAGAGTTGG CATTGCACCG ATGAATGGGA CATTATGTGC TACTCCGATG CACCGTCGTA TCCAGATTTA GAAATCGCCT GCCCTGAGCA TAGCATTCGT GATATTTTCG ATTGTGGCAA TGATGATTAT TTTCATACCA ACCCAGCCGC CGATAATTAC CTTGCCAGCA ATTGGAATAG CGCCAATAGC CTATTTTTGA GCCAACAAAG CCAATATTGG CGGATCTATC TGCCGATTGC CAGCAATTGA
|
Protein sequence | MNRSSSLFTL LIVAILLIPT SSQATTSTST RRFIDQPWLN LYYTNLVWED TGKCRAMFRV KTTDLCTHGP DSPEPQMAKS WAEELFYPNN QMHLSIEQSS QPCLDDSTND YRTQVLYVRA SDAQDNSSII APVLRKHLLG VNEIYNQSAQ LTGGQRHIRW QLNQACQIDV QHVVIPPAAD DDFGATINAV VAQGFNRDDR KYVMFVEARY YCGIATVQHD TQPSSANLNN HSVAYARIDA GCWNARVVAH EHMHMIGGVQ PSAPHTTKSW HCTDEWDIMC YSDAPSYPDL EIACPEHSIR DIFDCGNDDY FHTNPAADNY LASNWNSANS LFLSQQSQYW RIYLPIASN
|
| |