Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3059 |
Symbol | |
ID | 5734931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3865591 |
End bp | 3866880 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280203 |
Product | hypothetical protein |
Protein accession | YP_001545825 |
Protein GI | 159899578 |
COG category | [S] Function unknown |
COG ID | [COG4102] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTAA CCCGTCGTCA ATTTGTAGTT GGCTGTAGCA GCGCGATTGC AGCTATGGCT GGTGGTCGGC TGGGTGGTTT GGCTTTTGCC GAGCCAGGTG ATATTAACCG TGATATTTTT GTGGTGGTGT TTCTGCGTGG TGGTTGCGAT GGCATCGGTA TCGTTTCGCC GCTTGATGAT GCCAATTTTC AAGCCGCCCG TAGCACAATC ACCTTTCCAA GCAGTGGCAC AGGCGCGGGC TTTGAATTAG GTTCATTGAG CAATGTGCCG TTTTGGTTGC ACCCCAAAGC GGCTGCCTTC AAAGAACTCT ACGATAGCCA AGATTTGGCC TTTATTCACG CTAGCGGCTT GACCAACGGC ACCCGCAGCC ACTTCGATGC CATGGATTTT ATGGAACGTG GCACGCCCGA CAATAAATCA ACTAGCACAG GTTGGCTGAC CCGCCACATG GCTGCCACTC GTCCCGATGG GGTTGTGCCA GTTATGTCAA CAGGATCAGC TTTACCTGCT TCGTTGCTTG GCAGCCCGAA TGCCGTCACG ATTTCGAACG TGCAGCGTTA CGCTATGCAA GGCTACTCGA CCTATGGGGC GCAACAACAA GCCTCATTAA ACGAAATTTA TAGCCAAACT GGCAGCTTGC TTGATGGCCC AGCCACCCGT TTGCTTAGCT CAATCGCCGC AGTCAAGGCA CGCAACCCCG CCAATCCCTA CGTGCCAATT ACCACCTATC CTGCTGGGGG CTTATCGGAT TCGCTCAAAG CCATCGCCCA GATGATCAAA CTGGATGTTG GTTTGCAAGT TGCGACGCTT GATTTTGGTG GCTGGGATAC TCATGAATCG CAGGTGCCAA TTTTGGGCAA CCAACTTGAT TTATTGACGC GTTCGCTGCA TGCCTTCTAC AACGACTTGG TTGATTACCA CAGCAAGTTG ACGATTGTGG TGATGAGCGA ATTTGGCCGT CGCTTGAAGG CCAATCGTAG TGCTGGCACC GACCATGGCC ATGGCAATTT GGCGATGGTT TTGGGCGGCA ACGTCAATGG TGGGCGAATT TTCGGGCGCT GGCCAGGCCT CGCCAATGCC CAACTCGACC ATGGCGTTGA TTTGGCGATT ACCACCGACT ATCGCACGAT TTTGAGCGAA ATTGTGGTGC GCCGCTTGCG CAACAATCGT TTAGGCTTGG TTTTCCCACA AATTAGCCAA TATCAACCGC TTGGCTTAGT ACGGGGCACC GATCTAACAA TTGATTGGAC TTCAGGCTTC CGCTCATATT TACCAATGGC CCGCCGCTAG
|
Protein sequence | MDLTRRQFVV GCSSAIAAMA GGRLGGLAFA EPGDINRDIF VVVFLRGGCD GIGIVSPLDD ANFQAARSTI TFPSSGTGAG FELGSLSNVP FWLHPKAAAF KELYDSQDLA FIHASGLTNG TRSHFDAMDF MERGTPDNKS TSTGWLTRHM AATRPDGVVP VMSTGSALPA SLLGSPNAVT ISNVQRYAMQ GYSTYGAQQQ ASLNEIYSQT GSLLDGPATR LLSSIAAVKA RNPANPYVPI TTYPAGGLSD SLKAIAQMIK LDVGLQVATL DFGGWDTHES QVPILGNQLD LLTRSLHAFY NDLVDYHSKL TIVVMSEFGR RLKANRSAGT DHGHGNLAMV LGGNVNGGRI FGRWPGLANA QLDHGVDLAI TTDYRTILSE IVVRRLRNNR LGLVFPQISQ YQPLGLVRGT DLTIDWTSGF RSYLPMARR
|
| |