Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0773 |
Symbol | |
ID | 5732657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 873880 |
End bp | 875247 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277903 |
Product | hypothetical protein |
Protein accession | YP_001543549 |
Protein GI | 159897302 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000042468 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCTC AAACTGCAAA ACAATCGTCA CACTATATCA CGCAGCCAGC CAACCAATGG ATGGATCGGG CAATTCCTTG GCTAATCGTT GGGTGTTTAT TACCACTCTA TTGCTTGATG AGCAGCCCAA CCATTAGTAC AGCCTATGGC AGCAGCGATA GTGGCGAGTT GGCGACGGCA CTCTGGTTTG GCGCAGTGCC GCATCCGCCA GGTATGCCCA CTTATTTGCT GTTTGGGCAA ATTGCGCTCA ACTGGTTTGG CGGCGAGCCA GCTCAGCGTT TAGCGTGGCT GAGCGCCATC GCTACGGCCT TGAGTGCTGG CGTGGTCGCT GCGAGCGTTG GATTAAGCGC CAGCGAGCAA CCAGCCAAAA TCCGTTGGAG CGCCATGTTG GCTGCTGGAT TAGGCGTTGG GCTAAGTCAA CGAATTTGGC AACAAGCGCT GGTAGTCGAA GTTTATGCCT TGGCAAATTT GTTTCAAGCT TTATTGCTCT GGCTCAGTTT GCGCTGGCAA CGTTGGCAAC AAACCAGCAC ATTGGCGGCG CTCGGCTTGG TTTTAGGCTT GGGTTTGAGC GTGCAACTGC CAGTTGCCGC ATGGCTGGTT GGCTTCGGCT GGTTTTGGTT CAAAGCCAAA TGGTCGATTA GCTGGCGGCA AATTGGGCTA TGTTGGCTGA TGGTTGGGCT TGGTCTCGGT TGTTATTTGG TGTTGCCATG GCGTGGCGCG GCGGTTCCTC AAGCAAGTTG GGGCGATTGG CGCAGCGTTG AGGGGGCAAT TGCCCATATC AGCGCCAGCG AATATCGTTA TTTAGTTGGA GCCGTACCAC TGAGCGAGCA AATTCAACGG CTTGTTTTGG CATTCCGCGA TTTGCTAGCC AGCTATTGGT GGATTGTCGG GCCATTGCTA ATCGGCTTTG GCTGGACAAA AATCATGGTT GCTCAACGCT CGATGTTGAT TGGCTGGGCC GGAGTTACGT TGCTTTGGGC CATCAGTTAT GGCGGCGCTG ATGCCCAAGT CTATCTCTTG CCAATCCATT TGTTGGCTGG TTGGTTGCTG GGCATGGGCG TTGTGGGGTT GGCACACTAT AAATCCATAG AACGTTGGAT GTGGCTAGCG CCAGTGTTGA GTTTGATCGC ATTACCCTTC GGTTGGCAAT TTTCGCTACA CAATCAAACC CAAAGCCGCG ATCAAGCGAT CGCCTTATTG CAACAACAAC CGCCCAATGG TCAATTGCTC AGCAACGACG ATCAAACAAC ATTTAGTGCA TGGTATGTGC AAAAGGTCTT GGGGGTACGG CCAGATCTAG AGATTATTGA TCAGCGGTTA CAACAACAGC TGTGGTATCG GCAACGGGAA GGGTTACCAC TTCGCTAG
|
Protein sequence | MNPQTAKQSS HYITQPANQW MDRAIPWLIV GCLLPLYCLM SSPTISTAYG SSDSGELATA LWFGAVPHPP GMPTYLLFGQ IALNWFGGEP AQRLAWLSAI ATALSAGVVA ASVGLSASEQ PAKIRWSAML AAGLGVGLSQ RIWQQALVVE VYALANLFQA LLLWLSLRWQ RWQQTSTLAA LGLVLGLGLS VQLPVAAWLV GFGWFWFKAK WSISWRQIGL CWLMVGLGLG CYLVLPWRGA AVPQASWGDW RSVEGAIAHI SASEYRYLVG AVPLSEQIQR LVLAFRDLLA SYWWIVGPLL IGFGWTKIMV AQRSMLIGWA GVTLLWAISY GGADAQVYLL PIHLLAGWLL GMGVVGLAHY KSIERWMWLA PVLSLIALPF GWQFSLHNQT QSRDQAIALL QQQPPNGQLL SNDDQTTFSA WYVQKVLGVR PDLEIIDQRL QQQLWYRQRE GLPLR
|
| |