Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3048 |
Symbol | |
ID | 5734920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3849483 |
End bp | 3850709 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280192 |
Product | hypothetical protein |
Protein accession | YP_001545814 |
Protein GI | 159899567 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.756675 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGAC GCATTCGTTG GCTACTGTTA ATCGGTGGAG TTGTTGGCCT CGGTTTGGCG GTTGGAATCG GCTGGTGGCA ACGCGATCAA CCAGCAGTTA AGCTCAATCG AACGCTCGTA CAAGATCAAG CTGGCAATAT TCAACTAATC GATAGTCATA ATCAACAATT AACGTTGACA AATGATGCCT CATCAATCGT GCAATATATT CAGGTTACGC CTGCACCCGA TGGCCAACAC GTAGCCTATA TTCAACTAAC ACCCACAGTG ATCGAGATTC GAGTGCAGGC CTTCGATGGC AGCCCAGCTC GCACGGTTTT CAGCGATTTT AATCTGCGCC CATTTTATCT TTCCTGGTCA CCCAATAGCC AAATGCTGGC CTTTTTAGCA TCAGGCACAA CCATGGAATT ATATGTTGTG CCCGCTGATG GCTCCGAAGT AGCGCATAAA GTGCGTGATG GACAACCATC GTATTTTGCT TGGAAGCCTG ATAGCAGCGC TTTGTTATTG CACACTGGCG GCGGTACTCC GGTTGGCAAC ACCGCAGTCC ACTCAGTTCA ATCCAAAGAT TTAACTTTTT TCAAGGAAAC CGCTGGCGAT TTTCAAGCGC CTGCTTGGAA TGCTGATGGC TCAGCGCGGG TGGTGGTAGT AGCTGATGGC GAAATCAATC AACTGATGCA GATTGATCAG GCCGGGCAAC AGGCCTTGAG CGAACCAACC AGCGAAGGCT TTATGTTTGT GCTTTCGCCC GACCGTGCCA AAGTTGCCTA CCAAACCTTT GGCCTACAAA CCCGCTCAGG TTTGATGATT CAAACGATTG CTACTGGCAA AAGCCAAAGT TTCGAAACCG CCCGTCCCTT AGCATTTTTC TGGTCGCCAG ATGGGCGTTC GGTGGCCTTA TTGGTTGCCG ATGCTCGGCC ACGCGGCCCC AGCGGCGATG CTGGAATTGT CAAAGTCAGT CGCCAAGCCC AAAGCGGCGT GCAGGTGCAT TGGGAAGTAC TAGATGTCGA ATCGGGCCAA GTTAAACGGC TCAAATCGTT TGTACCAAGT GGACCTTTTT TGAATGTATT GCCCTATTTC GACCAATATG CCGCCTCGTT AACCTTCTGG TCGAGCGATA GCCAATATCT GCTCAACAAT AGCAGCGATG GGGTTTGGCA AGTGCATGTT GAAACAGGCG CAGAACAGCA ACTAACCAAG GGCGCATTTG GGGTTGCCGT GCCATAA
|
Protein sequence | MRRRIRWLLL IGGVVGLGLA VGIGWWQRDQ PAVKLNRTLV QDQAGNIQLI DSHNQQLTLT NDASSIVQYI QVTPAPDGQH VAYIQLTPTV IEIRVQAFDG SPARTVFSDF NLRPFYLSWS PNSQMLAFLA SGTTMELYVV PADGSEVAHK VRDGQPSYFA WKPDSSALLL HTGGGTPVGN TAVHSVQSKD LTFFKETAGD FQAPAWNADG SARVVVVADG EINQLMQIDQ AGQQALSEPT SEGFMFVLSP DRAKVAYQTF GLQTRSGLMI QTIATGKSQS FETARPLAFF WSPDGRSVAL LVADARPRGP SGDAGIVKVS RQAQSGVQVH WEVLDVESGQ VKRLKSFVPS GPFLNVLPYF DQYAASLTFW SSDSQYLLNN SSDGVWQVHV ETGAEQQLTK GAFGVAVP
|
| |