Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2381 |
Symbol | |
ID | 5734262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3033183 |
End bp | 3034628 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279522 |
Product | hypothetical protein |
Protein accession | YP_001545149 |
Protein GI | 159898902 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00647578 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATA TCACCAATCG TCAGCAGTTT TGTCGGGCAG TCGGGGATGG GTTAAAGTTA GTCCAAAGTC AACGAAGTGG CTCCATGCCA AGTACAGAAG CGTATATAGC CGATTGCTTG AGCATTGGGG TTGATACACT CCGTACATTA CGCTATGCTT CCCGTCAACG CGTGATGGTC GAAGATAAAA CTTTAGCCGC ATTAATTTGG ATTGTCCTGG CCGAAGGTCG GGCAACTCAA CCATGGCTGA TCACAATGCT CAGTGCGACT AGTATTATTC CCCCCGAACC ACTGACAACT ACTTGGCTTG AGAGCTATCT GCAAAGTGGT TTCAAACAAC AACTCGATCA AGCATTGCTG AGCCAGGTTG TCCAAAGCAT TCTTCCTAAC CAAGCTCCAA GTGTCCTCAA TCTCGTTGTT GATACCCAAA TCGATGCATC AGATTTGCCC TTAAATCTTC CAACCCCAGC GCCGAGTATT CAAGTTAAAT CAAAATTTCC CAACAAACCG TACCAGTGGT TGGGCTTATT TGGGCTGATC AGTGTGCTTG GCTGGCCGCT GTTTTCATTA TTTCAAGCAA CTGCGGCATC TCATGATGCT TCAAGCAGCA TGGCCTTAAT TCCTGCTGAT GAATATTTGC AAGGCAGTAG TGATGCTGAT ATTGCTGAAT ATACGCGTTT GTGCCAAGTG CATCAGACAG GCTGCGATAG CTCGTGGTTT GCCGATGAAC AACCACAGCG TTTGATTCAA CTTGATGCCT TTGCCATCGA TCGCTTTGAG GTCAGCAATC GCGATTTTCT GCGTTATAGT GAGGCTAACC CCAGCCTCTT AACCCAAGCT GAAACGCAGG GGGCAGGCTT TGTCTGGAGC GATAGTAATG GTTTCGAGTT GATCAATGGA GCAAATTGGC GGCATCCTCA TGGTCCAAAT TCAGCAATTA CTGAACATTT GGATAAACCA GTGGTGCAAA TAACGCCCAG TGAAGCCCAA GCCTATTGTA TTTGGCAAGG CAAACGCCTG CCAACCGAGG CCGAATGGGA GGTAGCAGCC CGTGGTAAGC ATTATTGGCG CTTTCCTTGG GGCAACGATT GGCAGCCCGC TAAGCTCAAT TTTACCCAAG GCAAGCTTAG CCCAGCCTTG ATGAACGTAG ATAGCCTACC CGAGGGTCAA AGTTTTTATG GGGTGGCGCA TATGCTTGGC AATGCTGCTG AATGGACTGC CGATTGGTAT GATCCGCATG CCTGTCAGTC CAACGATCGG CTTAATCCAC GTGGGCCAGT GATTCCAACT GCCCGGCATA CCCGCCGAGG TGGCTCGGTC GCTAGCATGG CCGGAGTTTT GCATAGCACC TGGCGGATTA GCAACGAGCA AATTAACGAT CAACCAAGCA ATGGCACAGG CTTTCGCTGT GTCCAACATA TCGCACTAGA TCAGAGCTTG CCATGA
|
Protein sequence | MADITNRQQF CRAVGDGLKL VQSQRSGSMP STEAYIADCL SIGVDTLRTL RYASRQRVMV EDKTLAALIW IVLAEGRATQ PWLITMLSAT SIIPPEPLTT TWLESYLQSG FKQQLDQALL SQVVQSILPN QAPSVLNLVV DTQIDASDLP LNLPTPAPSI QVKSKFPNKP YQWLGLFGLI SVLGWPLFSL FQATAASHDA SSSMALIPAD EYLQGSSDAD IAEYTRLCQV HQTGCDSSWF ADEQPQRLIQ LDAFAIDRFE VSNRDFLRYS EANPSLLTQA ETQGAGFVWS DSNGFELING ANWRHPHGPN SAITEHLDKP VVQITPSEAQ AYCIWQGKRL PTEAEWEVAA RGKHYWRFPW GNDWQPAKLN FTQGKLSPAL MNVDSLPEGQ SFYGVAHMLG NAAEWTADWY DPHACQSNDR LNPRGPVIPT ARHTRRGGSV ASMAGVLHST WRISNEQIND QPSNGTGFRC VQHIALDQSL P
|
| |