Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1966 |
Symbol | |
ID | 4710461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2162907 |
End bp | 2164172 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639856439 |
Product | protein of unknown function DUF395, YeeE/YedE |
Protein accession | YP_001003532 |
Protein GI | 121998745 |
COG category | [R] General function prediction only |
COG ID | [COG2391] Predicted transporter component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.629772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCTTCG ATTCCTTTGT CGCAGCGCAC TGGACGATGC TCGGCACCGT CTTCGCCATC GCGGTGTTGC TCGGTGCCGT CGTCAACAAG AGCAACTTCT GCACGATGGG CGCCGTCTCC GACATCGTGA ACATGCAGGA CTGGCAGCGG ATGCGCATGT GGATCCTGAT CATCGCCGTG GCGATCCTCG GCGTAGGGCT GCTCGAGCCC CTGGGGCTGA TCAACGCCGA CGAGAGCATG CCGCCGTACC GCGCCTCCGA TTTCGCCTGG GCCGGCTACC TCCTCGGCGG CCTGCTTTTC GGCATCGGCA TGACCCTGGG CAGCGGATGC GGCAACAAAA CGGTGGTGCG CATCGGCACC GGCAACATCA AGTCGCTGTT CGTCGCGGCG GTGCTCGGCA CGGTCGCCTT CTTCATGACC AACCCCCTGC CGCTGATCGA CGCCTCCCTG CGCGATCTGT TCTTCGGCTG GGTCAACGCC ACCGCCATCT CCCACAGCCA CGGCCAGGAT CTGGGCAGCC TGATCGCCGG CGAGGCGGGG CCGTGGGTGC GCCCCCTCCT GGCCCTGCTC ATCGGCGGTG CCCTGCTCTA CGCCGTTCTG CGGGTCGCCG GCTTCCGCCA GGATCGCAAC GCCGTCTCTG GGGCACTGAT CATCGGCGCC TGCATCGTTG CGGTGTGGAC GGTGACCAGC AACGTGTACG TGGCCGACGA GACGGGTCAA CGCGACACCC TCCAAACCTA CGCCACGGAC TGGGACTTTC ACCACCCGGA CACCGATGCG GGCCGCCCCG AAAGCACCCG CTGGCTGGCA CCGCAGGGGG TCAATTTCGT CGGCCCGCTG GTACAGAGCA CCCAGTACAC CGCCAGCGGC TTCAATCCGG GGCTGATCAC CGTCGGTGTC ATGGTGATCG GCGGCGTGAT CGTCGGCTCA TTCCTCTGGG CCCTGATCAG CCGCAGCTTC CGCTTCGAGT GGTTCGCCGA CCGACAAGAC TTCAACCGAC ACCTCACCGG GGGTGTCCTC ATGGGGATCG GCGGCCCGCT GGCCATGGGC TGCACCTTCG GCCAGGGTAT CACCGGCATG TCCACGCTGG CCCTGAGCGC ACCGCTGGCC CTGGGCGGGC TGATCCTCGG CAGCGCCCTG ACCATGAAGA TCCAGTACTA CAAGCTCCTC TACGAAGACG AGGCCACCTT TAGCAAGGCC CTGGTCACCG GCCTGGTGGA CCTTCGCCTG CTTCCGGCGT CGCTGAGGCA GCTCGATGCG CTTTGA
|
Protein sequence | MVFDSFVAAH WTMLGTVFAI AVLLGAVVNK SNFCTMGAVS DIVNMQDWQR MRMWILIIAV AILGVGLLEP LGLINADESM PPYRASDFAW AGYLLGGLLF GIGMTLGSGC GNKTVVRIGT GNIKSLFVAA VLGTVAFFMT NPLPLIDASL RDLFFGWVNA TAISHSHGQD LGSLIAGEAG PWVRPLLALL IGGALLYAVL RVAGFRQDRN AVSGALIIGA CIVAVWTVTS NVYVADETGQ RDTLQTYATD WDFHHPDTDA GRPESTRWLA PQGVNFVGPL VQSTQYTASG FNPGLITVGV MVIGGVIVGS FLWALISRSF RFEWFADRQD FNRHLTGGVL MGIGGPLAMG CTFGQGITGM STLALSAPLA LGGLILGSAL TMKIQYYKLL YEDEATFSKA LVTGLVDLRL LPASLRQLDA L
|
| |