Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4010 |
Symbol | |
ID | 5735871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5116857 |
End bp | 5118188 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641281160 |
Product | hypothetical protein |
Protein accession | YP_001546770 |
Protein GI | 159900523 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTACA ATCAATGGAT CGATCAGCAA GTTGATAGTG ATTTTCAAAA AGCGCGATTT AGCGCGTTTT TGAATGCAAT AAAAGCACGT TTGCGCGACG AACCGCAAAC GCTCTTACCA TTTGAAGAGG TTCGTGCTCG CCTCAATATT CGCAGCCAAA CCGATCGGGG CATGCAAATT GTGCCGTTGC AATCGATTAT TGGCAGCGAA GGCCGATATA GCGATTTCGA CCGCCAATTT TTGCCACGCC ACGAGGTAAC TAAAGCCCGT TGGAAAAACG TTGATCGGGC ACATTATAAC GATGTGATGC TCCCGCCAAT CGAACTCTAT CAAATTGGCG AGGTCTATTT TGTGCGCGAT GGCAATCATC GAGTGTCAGT CGCTCGCCAA CAAGGCCAAG ATTTTATCGA TGCCCATGTG ATCGAATTGC TCAGTGATGT GCCAATCAAG CCAACCATGA CCCAAGATGA ATTGAATCAA TTGGAAGAAC GCTCAGACTT TTTGGAATGG ACGAATTTGG CGCAGTTGCG GCCTGAAGCC CAGTTTATCG AGCTAACCAC ACCTGGTGGC TATTTGGATC TGATTCGGCA TATCAACGGC CATCGCTATT TCAAATCGTT GGAACTTGGC GAGGAGCCAA CCAGCGAAGA GGCGATTTTA AGTTGGTACG ACAATATTTA TAAACCTTTG ATCGACGAAA TTTATGATAC TGGGATTTTG GCGGCCTTCC CCGAGCGCAC CGCCACCGAC TTGTATCTGT GGATTATGGA TCATCGTCAT TATCTGACCC AGACTGAGGG GATCGATCCA GGCCCGAAAA CTGCGGCGAT CGACTATACC CGCCATTTTG GCGAGCGCAA AAATCGGGTT AAGTTACCTG ATCCGCCCAG CCATGCCGAA CTAGAGTTTA TTCAGTGGAG CAAACTCAAT CTGCTGCGCC AAGATGTGCG TGTGCCGCTC AGCAACGATA ACGATTATGC GCGGATCAAA TTGCATGTGA TCGATCATCA ATATTTTATG GGCAAGGATC TTGATCGCGA AGTCACGTTC GAGGAAGCTG TACAAAGCTG GTACGATACG GTGTATCGCC CAGTAACCCA AGCCATCGCT CAACAACATA TTGGTGAGAT GTTTCCACGC CATACAATCG GCGATTTATA TTTGTTGATC ACCGATTATT TGCATATGCT ACGCGGTCAG GGCGTGGAGA TTCAGCCATT ACAAGCGGCC CGCGAATATG CCGAACGCTT TGGTAGCGAA CGTGGAGCCT TTCTGACAGG CGTGTTGCAT CGAGCACGGC GATTGATGAA ACGAGCCTTT GCAACAACAT AG
|
Protein sequence | MGYNQWIDQQ VDSDFQKARF SAFLNAIKAR LRDEPQTLLP FEEVRARLNI RSQTDRGMQI VPLQSIIGSE GRYSDFDRQF LPRHEVTKAR WKNVDRAHYN DVMLPPIELY QIGEVYFVRD GNHRVSVARQ QGQDFIDAHV IELLSDVPIK PTMTQDELNQ LEERSDFLEW TNLAQLRPEA QFIELTTPGG YLDLIRHING HRYFKSLELG EEPTSEEAIL SWYDNIYKPL IDEIYDTGIL AAFPERTATD LYLWIMDHRH YLTQTEGIDP GPKTAAIDYT RHFGERKNRV KLPDPPSHAE LEFIQWSKLN LLRQDVRVPL SNDNDYARIK LHVIDHQYFM GKDLDREVTF EEAVQSWYDT VYRPVTQAIA QQHIGEMFPR HTIGDLYLLI TDYLHMLRGQ GVEIQPLQAA REYAERFGSE RGAFLTGVLH RARRLMKRAF ATT
|
| |