Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4347 |
Symbol | |
ID | 5736207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5554541 |
End bp | 5556433 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281508 |
Product | hypothetical protein |
Protein accession | YP_001547107 |
Protein GI | 159900860 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.954065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGTTG ATACGATGAC GATCTTCAGC CAGCGTCGCC AGTGGTTGAC GCTGGTTACG TTATTAATTG GGCTTTTAAT TGGTTTGGGC TTTGTATCCA ATCGCCAATG GCAAGTGCCG CTGAATCTGG GCTTGGAAGA GGGCATCAAC AACGATGCTC CATTTGTGGT TGGCTTTAAT GCTGGCGAAC AATTGCCCGA TCGTTCAGCT CGTTACCGTT GGTCAACCCT GAATGCCCAA TTGCGCTTTC CGCATGTGCC GCAACGCCAC TACTGGCTCG ATTTGCCACA ACTTACGGGC AACCCAGCCA GCCTCATCAT GGGAACCAGC CAATTTACCA GTACTGCCGG ACGGGTATTG CATGTGTTGT TGCCTGCCGA TGCCGCTGGT AAAGTGGCAA TTACGGTTGC TCAACCAGTG GTGAGTGATG ATCCACGCGA GCTCGGGGCG GCGTTTAGTG GTGGGCAATT AAGCAGCAGT GGTTGGGCTT GGCCTGGGCT CTATCCGACC TTAGCTTGGT TGTTTTTGCT GACGACACTT GGTTTGAGCA TAATCTGGCT TGGTGGTTCG AGCCTTGAAG TTGGCTTGGC AGTTGGCCTA ACAGGCTTGG GGATTGTGGC GGCAACTTGG TTTGCGCCAT TACGGGCAAG CTATGCCGCA CCAGCGGTTG CCCAAACAAC GCTCTATGGT TTGGCTGCAT TGCTGTTGTT GGGCTGGGCC TTGCCGCCAA TCTTGCAGCG CCTAGGCTTA ACGATCAGCC GCGATGTCTT GCGTTGGCTG ATTTTGGCGA CGGTGCTGGT CTGGTCGCTC AAATTGGGTG GACGCTTGCT GCTCGAGCAT ATGCCAGGCG ATATTGGCTT TCATCGCAAC CGAATTCATG CAACCAACCT TGGCGATTTA TTCCGACCAT CGCGCCATCG CGGCATCGAT TTTCCCTATC CGCCAGTGTT GTATGCCTTG TTGCAGCCAC TGACCTTGAC TGGAATTTCA GCCGATTGGT TGTTGCAATT AACCGCCGCA GCCTGTGAAG CCCTGGCGAT TCCGGTGCTA TTTTGTTTGG GCTTGCGCAC AACTGGCTCA AGCCGTGGAG CGCTGGTTGG CGCGATTATG TATGGGCTTG TGCCAGCAGG CTTTATGACC AACGCATGGT CGTTTGATTC GCACATTTTT AGCCAATTTG TGGCCTTGCT ACTGGCAACA TTTATGGTTT GGACGTGGCA ACACTGGCAT GAACGACGTA ACTGGCTTTG GATAACTTTG GGTTTGAGCA CGATTGCGCT CGGACACTTT GGCTTTTATC TTAATACTGG CTTGATGGGC GGCTTGCTGA TGCTATGGCT GTGGTGGCGC GGCCCACGTT CGCAAGGCTG GGCCTTATTC ACCAGCTTGG TAGCAACCCA AGTGATTGTT TGGGCTTTGT ATTACTCAAG TTTTATTGGG CTATTTTTGC AGCAAGGCCA ATCGTTTGCC GAAGGCGGCA TGAACGCAGT CAATCAGCGC GAAGCTGTAC CACGCCTGCA ACTCCTGTGG GATATGATTG ATCTTGGGTT TTGGCGGCAT TATGGTTTAC TGCCTGTCTT AATCGCGCCA TTTGGTTGGT GGCTCAGTCG CAAACATCGC GGCTTGCAAT TGGTCATGGG GGCGACGTTC GTCGTTAGTT TGATCTTGGC GGCCTTTCCA ATTATCAATG GATCGACCAT CACCACCCGT TGGCTAATGT TTAGCGCTTG GGCGATTGCC TTGGCCACTG GCATTGCCCT CGATTGGTTG TGGCAACGCA CGCGTTGGGG TCGCTGGCCT GCGATTCTGA TTACTAGTGG TTGTGCGATT TTTGGCATGA TCGTCTGGTT TGCTGCGATG GTCTATAAAA TTCGCCCACC GGAACCGTTT TAA
|
Protein sequence | MVVDTMTIFS QRRQWLTLVT LLIGLLIGLG FVSNRQWQVP LNLGLEEGIN NDAPFVVGFN AGEQLPDRSA RYRWSTLNAQ LRFPHVPQRH YWLDLPQLTG NPASLIMGTS QFTSTAGRVL HVLLPADAAG KVAITVAQPV VSDDPRELGA AFSGGQLSSS GWAWPGLYPT LAWLFLLTTL GLSIIWLGGS SLEVGLAVGL TGLGIVAATW FAPLRASYAA PAVAQTTLYG LAALLLLGWA LPPILQRLGL TISRDVLRWL ILATVLVWSL KLGGRLLLEH MPGDIGFHRN RIHATNLGDL FRPSRHRGID FPYPPVLYAL LQPLTLTGIS ADWLLQLTAA ACEALAIPVL FCLGLRTTGS SRGALVGAIM YGLVPAGFMT NAWSFDSHIF SQFVALLLAT FMVWTWQHWH ERRNWLWITL GLSTIALGHF GFYLNTGLMG GLLMLWLWWR GPRSQGWALF TSLVATQVIV WALYYSSFIG LFLQQGQSFA EGGMNAVNQR EAVPRLQLLW DMIDLGFWRH YGLLPVLIAP FGWWLSRKHR GLQLVMGATF VVSLILAAFP IINGSTITTR WLMFSAWAIA LATGIALDWL WQRTRWGRWP AILITSGCAI FGMIVWFAAM VYKIRPPEPF
|
| |