Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3760 |
Symbol | |
ID | 5735624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4729624 |
End bp | 4730826 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641280912 |
Product | hypothetical protein |
Protein accession | YP_001546524 |
Protein GI | 159900277 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACTC AGAAAAAATT GCTCGCTTCA CGGCCTCAGA TAGCCTTAGT TAAAATGAGC GAACAATCTG TGCAACAACG AATAACCGGC GCTTTAGCCG ATGAAATAGG TGCGCGTCCG GCAACCTCGA CCGCCGAGGC TCGTGCCGCC GCCGTGATCG CCGCTCAGAT GCGTCAAGTT GGCCTCGAAG TTGGCGTGCA AACTTTCTCC GCCGCTGCTG CGCCAACCGC AGGCTTGGGC TTATTGGCAG CGATCGGCTT GTTGGCGCTC GGCTTAGGCT GGTGGTTTCC CTATCCTTCA GTCGCCTTGA TTGCGCTATT ATTGCTGTTG GCAGGCCGCG AATTGCATGG CCCGCCCGTT TTGGCTGGCT TGTTGCGCCA ACGCCCAAGC CAAAATGTGA TTGGCACACG GGCGGCAACC CGTATTCCTC GTGCTCGCAT CGTTTTATTA TGTCACCTTG ATTCGCCGCG CATGCTCTCG CCACGCCGTG CCAGTTGGCT GCGGGTTTGG TTACTAACGA TTCCATTTGG ATTTAGTTTG AGCCTGATTG CGCTTGGGGT AGCGATTTTT CTACCTGCTT GGCATAGTGT GCTGTTAATT CCAGCATTCT TTTTGTTGCT CAGTTTGCTG GTGGTGATTC GGCGTGAGTG GAAGGCCGAT TGGACGGTTG GCGCGGTCGA TGCTGCTGCC GTTGGCACGG CAATCGCGTT GGCAGCCGAT TGGCCGCAAC GTGAAGATGT AGAACTATGG GTCGTGGCGC TGGGCGCAGG GGCCGCCGCT GGTTCGGGTA TTCAAGCCTT ATTGAATACC TATCCCTTCC CCAAAGCCGA AACGTGGTTT GTTAACCTGC CGTGGCTGGG CCGTGGCAAC CTCACAATTG TGGCTGGAGA AGGATTGTGG CGCGAACGAA AGCCCGATCC TCAACTTACC AAAATGTTCC ACGAACTACA ATCCGCCAGC GCCCCACTGA TTCGCTCGGC CTATCGGGGC GAACGCTTGG ATAGCGCTCG GCTTTTGGCT ATGGGCTATC ACGCGGTCAG CGTGGTTGGC TTGAAATCCG ATGGCACGGC AGCGGGCTTC CGTCAACCAG ACGATGAAAC CCGCTTACTT TCCGTGCCAC AGATGGAATT GGCCTTACGG GTGTTGCGGC GGGTGCTCGA CCGCTTTGCT CGCAGTCATA GCAACGAGCC GCAATTACCC TAA
|
Protein sequence | MATQKKLLAS RPQIALVKMS EQSVQQRITG ALADEIGARP ATSTAEARAA AVIAAQMRQV GLEVGVQTFS AAAAPTAGLG LLAAIGLLAL GLGWWFPYPS VALIALLLLL AGRELHGPPV LAGLLRQRPS QNVIGTRAAT RIPRARIVLL CHLDSPRMLS PRRASWLRVW LLTIPFGFSL SLIALGVAIF LPAWHSVLLI PAFFLLLSLL VVIRREWKAD WTVGAVDAAA VGTAIALAAD WPQREDVELW VVALGAGAAA GSGIQALLNT YPFPKAETWF VNLPWLGRGN LTIVAGEGLW RERKPDPQLT KMFHELQSAS APLIRSAYRG ERLDSARLLA MGYHAVSVVG LKSDGTAAGF RQPDDETRLL SVPQMELALR VLRRVLDRFA RSHSNEPQLP
|
| |