Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1048 |
Symbol | |
ID | 5732952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1196371 |
End bp | 1197825 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641278183 |
Product | hypothetical protein |
Protein accession | YP_001543824 |
Protein GI | 159897577 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00243617 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATGGT TTAGAGTGTT GTTTGTGCTA TTGATTGTAG CAGGATTTGG CCTGACATGG TATCTTTTGC AGCCTCAAAA TAGCTTGGTT TGGCAAACAT TCACCCCTAC GGTTATCCCG CTGGAACAGG CTGAGTTGGC CAATCCTGGG CGTGGGCTAT ATCAATGGCG TGGCCAAACC ATGATCTGTC CTAGCGAGTT GATTTCGAAT CGCGAACGCT ATGATCGCTG GACATGGGCC GAACTTGAGC CAAACGAAAA TCAATACGAT TGGCAAGAAA TTCATCAATT ACTGGATATG GCTGAGCAGA ACGGCCAGCG GGTTTGGCTC GGTTTGGGGG CAAGTGCTGG CCCAAGTAAT AATGGCCCGT TTTTACCAAT TTACTTGCAG AAACCTGAAT TTGGCGCAAA CTTTGAAGGC GATTGGTATC CAAATTACAA TCATCCTTTT GTGCAAAACC GCCTCGAAGC CTTGTTGGCA GCCTTTGTGG CCGAATTTGC GGGCGATCAG CGGATTTTGG GCGTGCAGAT GCGCAGTTAT GGCCGTTATG GCGAGGGCTA TTTGCCATGG AACGCCGATA AAAGCCATGG AATGTGGGCC AGCGAAAGTA CGGCCCGCTG GCTGGTTGAT GCTTGGCATA CCCGACTCAG CCCGCATTTT CTGATTTCAA TTCCGCTGAG TAATAATCCA GTGTTTTACT ATGCCATGAC CAAGCAACCC TATTGGAGCA TTACCCGTGA TGCCTTGGGC ATGCCTGAAC AAATGGGTAA TATTGATCAA CTCATTCAAA GTGATATTAC GGTTGATGGC CAAGCAATTG GCCCGTTGGT GGCTGAGCGT TGGAAAGTAG CGCCGATGTT TAGCGAGATG ATCGGTGAAT ATGGTGAGCG CGATTATAGT GGCCAGTTTT TGGCAGCCCA AACCCAAGTG ATTTCATATC ATATTTCGTA TGTGAGCAAC GGCAATTTTG CCCAGCCCTA TCGCGCAAGC CCGTGGGATT TCTGGCGTGA TCCAATTAAT TGTCCTGAGC AAGCCAGCAA TTGGAGTAAT GCCGATATTG AGAATTTTAT GCTGGCGGGC AAATTAGCCG GTTATCGCTA TGCCCCAACC ACGATCAAAC TCGCCATCGA TAACCAGCAG CTCCAGATCG AAAGTAGCTG GCAAAACGCT GGAGTTGCCC CGATATATGA GCGTTGGCCG TTGGTATGGC AATTACGCGA TGCTACTCAG ACTGTGGTTT GGCAAGGCGA ATCAAGCCTT GATTTACGCC AACTTTTGCC AGCCCAAGCC TATGAGCATC GCCAACAATT TGAACAATTC AAGCTGCCAG CGGGTGAGTA TGAATTGCGA TTGGTTGCTC CAGCGATCAA TCGCTATGTA CGGCCTTTGC AATTGGCAAT TGAAGGCCAG CTGGATGATG GGGCATATCG GATTGGAATT CTGAGCATTC GTTAA
|
Protein sequence | MRWFRVLFVL LIVAGFGLTW YLLQPQNSLV WQTFTPTVIP LEQAELANPG RGLYQWRGQT MICPSELISN RERYDRWTWA ELEPNENQYD WQEIHQLLDM AEQNGQRVWL GLGASAGPSN NGPFLPIYLQ KPEFGANFEG DWYPNYNHPF VQNRLEALLA AFVAEFAGDQ RILGVQMRSY GRYGEGYLPW NADKSHGMWA SESTARWLVD AWHTRLSPHF LISIPLSNNP VFYYAMTKQP YWSITRDALG MPEQMGNIDQ LIQSDITVDG QAIGPLVAER WKVAPMFSEM IGEYGERDYS GQFLAAQTQV ISYHISYVSN GNFAQPYRAS PWDFWRDPIN CPEQASNWSN ADIENFMLAG KLAGYRYAPT TIKLAIDNQQ LQIESSWQNA GVAPIYERWP LVWQLRDATQ TVVWQGESSL DLRQLLPAQA YEHRQQFEQF KLPAGEYELR LVAPAINRYV RPLQLAIEGQ LDDGAYRIGI LSIR
|
| |