Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4095 |
Symbol | |
ID | 5735954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5226737 |
End bp | 5228713 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281247 |
Product | membrane protein-like protein |
Protein accession | YP_001546855 |
Protein GI | 159900608 |
COG category | [S] Function unknown |
COG ID | [COG5305] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAAC ATCCCACTCG CAGCCTCAGT TTTGGCCTTT TGATCATCGC GCTGGGCTTA CGCTTCTTCC GCCTGAGCAG CCAAAGTTTA TGGCTCGATG AAGGCGGTAC CTGGAGCGAG GCTACTGGCC GTTCGTGGCT CAGTTTATTC GGCGATCTGT TTAGCCCACG CCAAAGCTAC CCACTGTATC ATTTGCTGAT GAAGGCTTGG ACGAGCGTGT TTGGCGATAG CGAGCTGGCC TTGCGTGCGC CTTCGGCGCT TGCTGGGACG CTGGCGGTGG TGTTGCTGTA TCATTTGGCA TGGCGTTTGA GCACACCCAC AGTGGCTTGG GTTGCGGCTG GCTTGTTGGC GATCAATCCG TTTGGTTTGT GGCAAAGCCA AGATGCCAAA GTCTATAGCA TGTTGATGGC GGCGGTGCTT TGGTCGAATC GAGTATTGCT CGATCTGCTT GATCAAACAA AACCGAAGCA GGTTTGGCTG TGGTTTGCCA GTGTGGTGCT CTGCCTAAGT TTGCATCGTT TGGCCTTGTT GCAAGTGCTC GGTCAAGTGT TGGTATTGGC GTTGCATTTC AGGCAAGCAG CCTGGCGCAA ATGGCTTTGG CTTGGGTTTG GCTTGCTCAG TCTTGGTTTT GTGGCAGGAA TTATGTTTGG CTTGCGCCAA GACCCGAATG CACCGCCGAT TGGCCGCACG ATTGGGGCAT TTCAAGCAGC TTGGCTTTTG CTTGGCCGCC TCAGTTTCGA CCGTTCGAGC AGCGATTTAA CTTGGCGTTT GTTGCCTGTG GGTTTAGCAC TCGCCAGCGG TTTATGGCTG ACATGGAGCC ATCCACGGCG AAACATTATT TTAAGCTTGT GGCTCTTGCC CACGCTGCTC TTTTTGGCGG TATTGGCAGC AGGCTCGCCG CTCTACGAGC CACGCTATCT CAGTTTTAGC TTGCCGCTAT TTTGCTTGAT TTTGGCCTTA GGCTTGGCTG GTATTGCTCA ACAACGCTGG CTAAGTGCTG GCAGTATCGT GGCGATTACA GGGTTAAGCG CTTGGTTGGT GTTTCAGCAG CCTTATGGCA TTTGGAGCGG CAACGCGGTC AAGGAAGATT ATCGTGGGGC CGTCCAAAGT TTGGCTGAGC ATGTTGCCCC TGATGATGTG ATCATCGTGC AACCTCCCTA TATTGCAACC TTGTATCAAT ATTATGCCAG TCGTGTTACA CCCGATGCGT TGCCTATTGC GCGTGGGTTT GGGCGCATCG GCGCAATTGG CTATGATCAA GGCGAGTTTG ATAATGATTA TGCTGCGTTA TTGGCAGGCC AACGCCGTGG CTGGCTGTTG ATTGCACCCG AAAATGCCAA AGCGATCGAT CCGCCTAACC CGCAATATCC CCAAGATGAT ATGGGCAAAG TTGGGATTAA TTTTCTGACT GCTGATTTAA ATGATAAATG GCGTTGCACC GATTTACCCT ATTGGGAATT TAATGCATTG CGGATTTTGT GTCAGAGCTT TCCACGGCCA ATGCAAGTCG AAAATTTGGC GCTGGTAGGC ACATGGCCCA TTCCCAAATC GGCGACTGCA ACCTGGGATA ATAGCTTACA ACTTGAGGGC TACCAATTTC AAGCTTGGGC TGATGGCTAT CAGGCAGGTG GCACATTGCC TGTGCAACTG GCTTGGCGCA CCAACAACGT TTTAGTCAAA GATTATCGTT TATTTATTCA TTTAGTGCCA GCGTTGGGTG CACCAGTTGC TGCCCAAGTT GATACGATGC CGCTCAATGG TGGTTTGCCA ACCAGTCGCT GGCAGGCCAA TCAAGCGATT CATGATCAAG TTTCTGTGCC CTTGCCCAAA AATCTTGCCA AAGGCCGCTA TTTGGTCGTG CTCGGCTGGT ATGATCCAAC AATTACCGAG GTTGCGGCTC AGCGCTTGCC CGTGACAGCC AGCACCGCCC AGCATAGGGC CAACTACATC GAATTGGGCA CGGTTGAGAT CGAGTAA
|
Protein sequence | MPKHPTRSLS FGLLIIALGL RFFRLSSQSL WLDEGGTWSE ATGRSWLSLF GDLFSPRQSY PLYHLLMKAW TSVFGDSELA LRAPSALAGT LAVVLLYHLA WRLSTPTVAW VAAGLLAINP FGLWQSQDAK VYSMLMAAVL WSNRVLLDLL DQTKPKQVWL WFASVVLCLS LHRLALLQVL GQVLVLALHF RQAAWRKWLW LGFGLLSLGF VAGIMFGLRQ DPNAPPIGRT IGAFQAAWLL LGRLSFDRSS SDLTWRLLPV GLALASGLWL TWSHPRRNII LSLWLLPTLL FLAVLAAGSP LYEPRYLSFS LPLFCLILAL GLAGIAQQRW LSAGSIVAIT GLSAWLVFQQ PYGIWSGNAV KEDYRGAVQS LAEHVAPDDV IIVQPPYIAT LYQYYASRVT PDALPIARGF GRIGAIGYDQ GEFDNDYAAL LAGQRRGWLL IAPENAKAID PPNPQYPQDD MGKVGINFLT ADLNDKWRCT DLPYWEFNAL RILCQSFPRP MQVENLALVG TWPIPKSATA TWDNSLQLEG YQFQAWADGY QAGGTLPVQL AWRTNNVLVK DYRLFIHLVP ALGAPVAAQV DTMPLNGGLP TSRWQANQAI HDQVSVPLPK NLAKGRYLVV LGWYDPTITE VAAQRLPVTA STAQHRANYI ELGTVEIE
|
| |