Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0806 |
Symbol | |
ID | 5732706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 910562 |
End bp | 912013 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277937 |
Product | hypothetical protein |
Protein accession | YP_001543582 |
Protein GI | 159897335 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000470331 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGCGAC GTTTTCGGCG GCGACGCTTA GCCTTGTGGG CAGTGATTGG GGCAGTTTGT GGCTTGATTG TAATTTTACG AATGCCACTG CATCAACTGC CACTGGAGCG CGACGAGGCT GCCTATGCCG TCATTGCCCG CGATTGGGCT GATGGGGCAA TTCCCTACCG CGATCGATTT GATCATAAAC CACCGTTTGT TTATGCTGCC TATGCTCCGC CAGTTTTGTT AGGCACTGAG CCAATTCAAA CAATTCGGGT TTGGGCGACG CTCTGGCTGG AAGCCACAAC TTTAGTGATG TGGCGAGCAG GTCGGCGTTT GTGGCGCTCC GAAGCGGCGG GCGTGCTCAC CGCAATTTTA GTCGCCAGCT GGAATAGCGC AGTTTTCTTG CAGGGCGTGA CCTTCAACAG CGAAGCAATT ATGCTCTTGC CCAGTTGCTT GGCCTTGCTT TTTGGCCTCA AAGCGCTTGA TAGCCAACAA AAAGCTTGGT GGATGTTGGC GGGCGTGGCA GCAGCCTTAG CATTATTATC CAAACCTGTG GCTTTGTTAA TTTTGCCAGC GCTAATTATT ACCACGATTG CCTTCAAAGG CGATGTCAAC GAACGGCTTG GCCGCGCAAT TTTGGTGATC GACGGCTTGG CGCTGGTGTT GATTCCAACT TTGCTCTATT TTGCCTTAGC TGGTGGTTGG AGCCAATTTG TTGAAGCACT TTGGACGTAT AACACGGCTT ACCTCGACCA AACCGATGCC ACAATCAGCC AGCTTTGGCC GATTTGGCGC TCGCTCCTGT TGCTGCTAGT GGCGGGGATT GTGGGCGCAA CCATGGCTTG GCGACGGCGG CGCTACGGGC CGAGTTTAAC TCTGGCTGGC TTATGGAGCA TCGGCTTGAT TGCCAGCGCC TTTGTAAGTT TACGCGCTTA TCCGCATTAT TATCAGGCGC TTGTGCCCGC TTTGGCTGTC TTTGCAGGCG GCTTGGCCAA AGTTCGGCTG GGATTTTTGC AACAATACAA AGATTTTTCG ATTGTTGTGA TCGCGTTTTT GCTAGCGATT CAGCCATTAG CCAGCCTCTG GCCGTTGTAT AATAAAACAC CCGAAGCCCA AATTGAACAA CTCTATGGCG TTGATGGCCG TGAATTCTTT GCGCTCGCGC CCAAAGTTGT CGAATGGATC GATAGCAATG TGGGCCAGCA GGCCAGCGTC TGGGTTTGGG CGGCTGAGCC AGAAATTTAT TTGTATGGCG ATTACGCTGT GCCAAGCCGA TTTCCCTACG ATTACCCCTT AGCGATTTTG CCCAATGCCC TCGACCAGAC CTTGCAGCAA CTGCGCAGCC AGCCACCAAA GGTCATTGTG ACCTATGGAG CCGTGCGCCC AATCGGCTTT GATGCTATGT TGCAAGTTGC GCCCTATCGT TTACGAGTGC ATTTGGCTGG TTATGATATT TGGGTGCGTT AG
|
Protein sequence | MLRRFRRRRL ALWAVIGAVC GLIVILRMPL HQLPLERDEA AYAVIARDWA DGAIPYRDRF DHKPPFVYAA YAPPVLLGTE PIQTIRVWAT LWLEATTLVM WRAGRRLWRS EAAGVLTAIL VASWNSAVFL QGVTFNSEAI MLLPSCLALL FGLKALDSQQ KAWWMLAGVA AALALLSKPV ALLILPALII TTIAFKGDVN ERLGRAILVI DGLALVLIPT LLYFALAGGW SQFVEALWTY NTAYLDQTDA TISQLWPIWR SLLLLLVAGI VGATMAWRRR RYGPSLTLAG LWSIGLIASA FVSLRAYPHY YQALVPALAV FAGGLAKVRL GFLQQYKDFS IVVIAFLLAI QPLASLWPLY NKTPEAQIEQ LYGVDGREFF ALAPKVVEWI DSNVGQQASV WVWAAEPEIY LYGDYAVPSR FPYDYPLAIL PNALDQTLQQ LRSQPPKVIV TYGAVRPIGF DAMLQVAPYR LRVHLAGYDI WVR
|
| |