Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3928 |
Symbol | |
ID | 5735789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4923190 |
End bp | 4924089 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281079 |
Product | ApbE family lipoprotein |
Protein accession | YP_001546690 |
Protein GI | 159900443 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGAAT TGATCTTTCG GGCAATGGGT TGTGAAGTGG CAGTGCTTAG CGATCAGTCA ATCGATTTTG AGGCAGTGCC ACAATGGTTT GAGGCTTGGG AGCAAACGCT AAGCCGTTTT CGCCCAAATA GCGAATTGAT GCGGCTCAAT CAGCGGGCTG GTCGGCCACT GGCAGTCAGC GAAACCCTGT GGGAAGTGCT AGGTTTGGCG CTCGATGCTG CCGCTTGGAG CAAAGGCTTA GTTGTGCCAA CTCTGCTGCC CGCGCTTGTG CACGCTGGCT ACGATCGCCC ATTTGAATTG TTGAGCAACG CGGTCAACCA ACCACAGGCT TGGCACTATC AGCCACAAGC ATGGCTGGCA ATTTCTCGCC GCGATCAAGG TCGTTTGGTG CAACTAGCCC GTGATACCGC CCTTGATTTG GGTGGTATTG CCAAGGGTTG GGCCGCCGAT CGAACTGCTC AGCAATTGGC AACTCAAGCG GCCTGCTTGG TTGATGTCGG TGGCGATTTG GCAACCGCAG GCCAGCGCCA AGATGGTTTG GCTTGGGCGA TCGGTGTGCC AAACCCACGC ACCCAGCAGC GTGAAGCCAT GATCTGGTTT GCAACTGGTG GTGTAGCAAC CTCAGGTGTT GATTATCGGC GCTGGCGCAA AGGCGATCGC TGGCAACATC ACCTCATCGA CCCACGCACT GGCGAACCAA GCACGAGCAA TGTTTTGAGT GCCACGGTGA TTGCGCCCAA CGCAGTGCAA GCCGAAGTTG CCGCCAAAAT TGTGGTGTTG CTGGGCATCG AACATGGCTT GGCTTGGATT AATCAGCATC CTGCTTTTGC GGCACTGGCC TTTGATCACG CTGGCAAAAG CTATAGCAGC GAACGTTTTA ATAGTTATCG TTGGGAGTAA
|
Protein sequence | MRELIFRAMG CEVAVLSDQS IDFEAVPQWF EAWEQTLSRF RPNSELMRLN QRAGRPLAVS ETLWEVLGLA LDAAAWSKGL VVPTLLPALV HAGYDRPFEL LSNAVNQPQA WHYQPQAWLA ISRRDQGRLV QLARDTALDL GGIAKGWAAD RTAQQLATQA ACLVDVGGDL ATAGQRQDGL AWAIGVPNPR TQQREAMIWF ATGGVATSGV DYRRWRKGDR WQHHLIDPRT GEPSTSNVLS ATVIAPNAVQ AEVAAKIVVL LGIEHGLAWI NQHPAFAALA FDHAGKSYSS ERFNSYRWE
|
| |