Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3809 |
Symbol | |
ID | 5735673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4781061 |
End bp | 4782161 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280961 |
Product | hypothetical protein |
Protein accession | YP_001546573 |
Protein GI | 159900326 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000111673 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCGAC GAGCAGCCAT CCATCGCTGG TTATTGGTTT TAGTCGCGCT ATTTGTGGCC ATGCCAGTTG CGGCGCAAGC CCCGACCGAA GGCGCAAAAG CTGGCGCGTG GATTGTCACT CAGGCTCAGG CCGATGGCAG TTTTCCTGGC TTTGGGTTGG GCGAAACCGC CGATGCCGTC TATGCCTTGA AGGCAACTGG CTTGAAAGTT GATCTCAACG TTCAAAGCTT TATCGAAAAA AATGCTAGTG CAATCGCCGC TAAGCCTGGG GTTGCCGCCA AGTTTGTGTT GGCCGAATTG TTGCTGGGCT ACAATCCTCG CGCCGTTGCC GGAACCGATT TGGTCGCGGC TGTAACTGGC AGCTACAAAG CCGATAGTGG TATGTATGGT GGCGATGTCA CGACCCATGC CTTGGCTTTG TTGGCCTTGA ATGCTGCTGG CGCACCAGTC GAAAACAAAG CGATCAACAC CTTGAACTCG GTCCAAATCG CCGATGGATC ATGGTCGTTC AGCGGCGATA CAACTGCTGG CGCTGGCGAT ACCAATACCA CCGCTTTGGT AGTACAAGCC TTGGTTGCGA TTGGTCAAGG TAAGAGCGAA GCTGTTACCA AGGCGCTGAG CTACTTGCAA AGCCAACAAA ATAGTGATGG TGGTTTTCCC TATTCGAAGG CTTCGAGCTA TGGCAGCGCC ACCGATGCCA ACTCAACCGG CTTGGTAATT CAAGCGATTG TGGCAACTGG CGGTAATCCA ACCGCTGCAC CTTGGGCGAC GGCGACTGGC AATCCATTGA GTGCCTTGTT GAGCTTGCAA AATGCCAGCG GTGCGTTCCG CTACGATGCA GCAACGCCCG ATGATAATGC GTTTGCGACA TATCAAGCTA CGCCTGCTTT GTTCTATGTG ACCTATCCTT TGACCGCCTT GGTGACAGCA CCCCAACCAA CTGCTGTGCC AAGCACACCT GTGGCAACCG CTACCCCCAA ACCAAACACC CCAATTACCT TGCCCGACAC TGGCGCACCT GCATTGCCAT TATGGCCAGT GGTGATTGTG TTTGGCTTGG CTTGTATTGT GGCTGGTTTA CGTTTGCGCC GCGTCGCTTA A
|
Protein sequence | MLRRAAIHRW LLVLVALFVA MPVAAQAPTE GAKAGAWIVT QAQADGSFPG FGLGETADAV YALKATGLKV DLNVQSFIEK NASAIAAKPG VAAKFVLAEL LLGYNPRAVA GTDLVAAVTG SYKADSGMYG GDVTTHALAL LALNAAGAPV ENKAINTLNS VQIADGSWSF SGDTTAGAGD TNTTALVVQA LVAIGQGKSE AVTKALSYLQ SQQNSDGGFP YSKASSYGSA TDANSTGLVI QAIVATGGNP TAAPWATATG NPLSALLSLQ NASGAFRYDA ATPDDNAFAT YQATPALFYV TYPLTALVTA PQPTAVPSTP VATATPKPNT PITLPDTGAP ALPLWPVVIV FGLACIVAGL RLRRVA
|
| |