Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3817 |
Symbol | |
ID | 5735681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4791123 |
End bp | 4792013 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280969 |
Product | hypothetical protein |
Protein accession | YP_001546581 |
Protein GI | 159900334 |
COG category | [R] General function prediction only |
COG ID | [COG5006] Predicted permease, DMT superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000393324 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGCAG TGCAAATCTC AACCAATCAG AATCGGGTCG AAATACCATG GCGAGCCTAT GTGGCGCTGG CAATTGGCAT TGTGTGTATC GCATTATCGG CAATTTGGGT CAAGTGGGCG GCTGTGCCTG GGCCAGTTTC GGCCTTTTAT CGCATGCTGA TTCCAGCGGC GATTTTGCTG CCATGGTGGT TTGCCAAACG GCCTAAGCAT CTGCCCAAGC AAGCTACGCG GCTTTCGCTG CTTGGCGGCG TTTTCTTTGC CTTCGATTTA GCCCTGTGGA ATAGCGCAAT TTTGCTGACT TCAGCTGCCA ACTCAACCCT GTTTGCCAAT AATGCCCCGC TCTGGGTTGG CTTGGGTGCT TGGTTGATTT TTCGCGAACG CTTGCCGCAA CGCTTTTGGT GGGGCATGGC GATTGCCCTG ATTGGCGTGG TGGTAATCAT GGGCGAGAAT TTGCAACAAC TTACCATGAG CCAAGGCGAT TTGTTGGCGA TCTCGGCAGG TGGTTTTTAT GCGGCCTATT TGCTCACGAC CCAACGCGCC CGCGCTGAAC TGGATACCTT GACCTTTATG ACTTTGGGAA TCGTGGTCAG CGTGGTGATT TTGGGCTTGA TGTGTTTGAT TGGCGGCTAT AGCATCATTG GGTTTAGCCC CCAAACATGG TGGTCGTTGA TGGGTTTGGG CTTGGTTTCA CACCTCGGCG GTTGGCTGGC GATCAACTAT GCACTTGGGC ATATCAAAGC TGCGACGGCT TCAGTCAGTT TGTTGGGCCA GCCAGTCTTA ACAGCCTTGA TCTCAATTCC GCTCTTGAAT GAATCATTAA ATATCTTTCA AATTATTGGC GGTAGCTTGG TGATTGGCGG AATTTGGCTG GTTAACACCC AAAAAGCGTA A
|
Protein sequence | MQAVQISTNQ NRVEIPWRAY VALAIGIVCI ALSAIWVKWA AVPGPVSAFY RMLIPAAILL PWWFAKRPKH LPKQATRLSL LGGVFFAFDL ALWNSAILLT SAANSTLFAN NAPLWVGLGA WLIFRERLPQ RFWWGMAIAL IGVVVIMGEN LQQLTMSQGD LLAISAGGFY AAYLLTTQRA RAELDTLTFM TLGIVVSVVI LGLMCLIGGY SIIGFSPQTW WSLMGLGLVS HLGGWLAINY ALGHIKAATA SVSLLGQPVL TALISIPLLN ESLNIFQIIG GSLVIGGIWL VNTQKA
|
| |