Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4803 |
Symbol | |
ID | 5736648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6127380 |
End bp | 6129251 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641281969 |
Product | hypothetical protein |
Protein accession | YP_001547562 |
Protein GI | 159901315 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0267485 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTTCA AAAGGTTCGT ATGGGTTAGC CTAGGGTTGG CTGGACTCAG CTTGATGCTC TTGGTTGGCA ATACACTCAG TTTACGGCTA TTTGGCCAAA TTCAACAGAT TGATGTTGGC AATTGGGGCG ACCAAAATAA TGTTGCTGGC GGCTTCGAGC AAGAACAAAA CGCCCTCGGT GAAACCTACC GTTGGACTCA AGCCGAGGCT ACGATTCGGC TTCAGGGCTA CGGTTCGGCT AGCTCACGCT TGCTTAGCTT AAAAATAGGT GGCGTACCAA GTAGTTTGCC AGTTACCGCA ACCATGCAGG TTTCGACTAA TTCCGCTAGT GTGGCCTTGC CACTAACCCA AACTGCTCGT CACTATCACC TGTTGATACC GCCAACCTAC CAGCCCGATT GGCAGGTTCG GCTCAATATC CCAACCCAAC AAGTTTTGCC CGACCCCCGA TTTTTGGGGG TGCGGCTTGA TCATGTGCAA ATTCAAACCC CAAGTATTGC TTGGTCAAGT GTTCATTGGC ACTTATTAAT CGTGCAATTG GCAATTATGA GTAGCCTTAT TGGGGTATTG TGGTTTTTAG CTGCTGATTG GCCGACAATT GTTGGTATCA GCGGCATAAC AATCTTGGTG CTGGTAACAA TCACTGGTAA ATTTGTATTG GTGGCATGGG CATGGCAACT CCGTTTATTA ATTGTAGCTG CTGCTACAAC GCTGTTAGTT GGCTGGCTCC GTTCATTAAT TCAGAATCAG ATTAAGCATT TACTAAAACC GTATGAATAT CGGTATTTAA TTATATTTAG TGTTATTAGC TTTTTCATTC CAGTAATGAG TATCCTATTT CCAAATTTTG GCTCACATGA TCGGGTAATT CATGCTGATC GTTTAGGTCA AGTTGCTCAA GGCTCAGCAT TATTATTAGA TAAATTATAT GAATTTCAGG GCCGCGAAAC CATTACCCCA ACCACATTTT ATCTATTAGC ACTACCACTA ACTGTATTTT TTAATAATAA TGGATTAATT ATTGAAGGAT TGTATACATT TTTGCATGCA AGCAGTGGGA TTCTTTTGGC AATAACCTTA TTACGCTGGA AGGTTCGGCC TATACTTGCA CTTGCGGCAA TGATTTTAAT CTCAGCAATG CCAATTCAAA TGACAATTTT ATGGTGGGGT TTTGCCCCTC AAATTGTTGG TCAGTGGTTG ATTCTTGTAT TTTTGGCTGT TTTTAGTTTT CAATCGACTC TTCGTCCAAC CATAATCAGC ATTGGGATAC TCAGTTTGGC TATCTGGATG CATAATGGGG TAGCACTTTT AGCAGGAACC TGGATCGCTA GCTATTGTGC ATTAGGCTAT TGGCGCGATC CTGCCCAACG TAGGCATTAT TGGGCTTGGT TTTTAAACTT AATAGGAATC AGCATTTTTG GGTTGTTGGC AATTTATATT GATCTTTTTA TGACAACAGG TACAACCCAA CAACAGGTAT TGGGCTTGAC TGAATATCTG CCAGCAGTAA TCAATGGTTT ATCTGCAAGC TTTGCGCCAA TTGGAATTAT GTTTGTAGCT ATTTTCGGAT TATTACCATT TCTCCAATTA GAAAAAAACA AAAAAATCTT GCTTATTGCT AGTGGATTAA CATTTCTATT GTTTTTGGCC ATTGATATCG TGTTTGGTGT ACAGGTACGT TATAGCTATT TTATTCTGCC ATTCCTATTG ATGATTGGAA TAATATTTAT CGATCAGCGA TTAAGCATCA TTCCATATGT CGAATCTGTG ATTATAACGC TAACACTGCT TTGTTATGGC TATAGTTTGT ATAGTTGGTA CGATGCAATT ATCTATGGCG TAAAGCCTAG TTTGCTTGGG CTAACCCACT AG
|
Protein sequence | MQFKRFVWVS LGLAGLSLML LVGNTLSLRL FGQIQQIDVG NWGDQNNVAG GFEQEQNALG ETYRWTQAEA TIRLQGYGSA SSRLLSLKIG GVPSSLPVTA TMQVSTNSAS VALPLTQTAR HYHLLIPPTY QPDWQVRLNI PTQQVLPDPR FLGVRLDHVQ IQTPSIAWSS VHWHLLIVQL AIMSSLIGVL WFLAADWPTI VGISGITILV LVTITGKFVL VAWAWQLRLL IVAAATTLLV GWLRSLIQNQ IKHLLKPYEY RYLIIFSVIS FFIPVMSILF PNFGSHDRVI HADRLGQVAQ GSALLLDKLY EFQGRETITP TTFYLLALPL TVFFNNNGLI IEGLYTFLHA SSGILLAITL LRWKVRPILA LAAMILISAM PIQMTILWWG FAPQIVGQWL ILVFLAVFSF QSTLRPTIIS IGILSLAIWM HNGVALLAGT WIASYCALGY WRDPAQRRHY WAWFLNLIGI SIFGLLAIYI DLFMTTGTTQ QQVLGLTEYL PAVINGLSAS FAPIGIMFVA IFGLLPFLQL EKNKKILLIA SGLTFLLFLA IDIVFGVQVR YSYFILPFLL MIGIIFIDQR LSIIPYVESV IITLTLLCYG YSLYSWYDAI IYGVKPSLLG LTH
|
| |