Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1899 |
Symbol | |
ID | 4710677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2088879 |
End bp | 2091170 |
Gene Length | 2292 bp |
Protein Length | 763 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639856372 |
Product | Na+/solute symporter |
Protein accession | YP_001003465 |
Protein GI | 121998678 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGCTA TGGCCATATG GCTGTTGGTG TTTGTAGGCC TGTACTGGGG CTACTGCATC TTCTGGGGTA TCAAGGGATA CCTTTGGTCG CGCACAGCCA GCGACTACTT CGTGGCCGGG CGCTCCGTCA GTATGTGGGT CTTCATACTG GCGGCAACGG CCACATCGTT CTCAGGGTGG ACGTTCGTAG GACACCCGGG CCTGATCTAC GAGGACGGCC TGCAGTACGC CTACGCCTCG TTCTACTCCA TCTGCATCCC GTTCACCGGC ATGCTGTTCC TCAAGCGCCA ATGGATGATT GGCAAGCGTT GGGGTTACGT CACGCCGGGC GAGATGTTCG CGGACTACTT CCGGACGGAC TCGATCCGCA TCCTGATCCT CATTGTGGCG CTGATCTTCG CAGTGCCGTA CCTGGGGATT CAGCTGCGCG CCTCGGGCTT CCTGTTCCAC GTGCTCACCG ACGGCTGGAT GGGCGTCGAG ATCGGCATGT GGCTGCTGTC CGCGGTGGTG CTGTTCTACG TGGCCTCGGG CGGTCTGCGC GCCGTGGCGT ACGTGGACGC GGCGCAGGCT GTTTTGCTCA TTGTGGGCAT CTTCGTCATC GGCGTCGTGA CGCTCTACTA CATCGGCGGC TGGAACAACT TCACGCACGT GATTGCCGGC CTGGTCGAGT GGGAGACCGC CACCGGTGGT GCGGGTGAGG TCTCGCCGGG CATGGTGCCG GGGGATAGCG GTGCCAGTGG CTATGTCGCC ATCCCGGGTG CCATCCAGTG GGTCGGTGCC GCGGCCAACG CCCAGGGCGG TGTCTGGACC GGCGTCATGA ACATCACCTA CATGCTGGCG CTGGGCGGCA TCATGGCCTC CCCGTCGTTC ACCATGTGGG CGTTCTCGAA CCAGAACCCG CGTCCGTTTG CGCCGCAACA GACCTGGATG TCCCCGGTGG GCGTCGGTGC CCTGATGTTC ACCTTCCTGG CGATCCAGGC GATGGCCACC CACGGCCTGG GGGCCAACAC CGAGTTCGCC AAGGACATCT TCTCCGACGA GCACGGTGAG CAGCTGGCCG AGTACCGGAC GCTGTTCGAG GCCTCCGAGG AGCACCAAGG GCTGGTGGAC GAGGTGCGCG AGCGCCTGGA CGCCGGCGAG TCCCTGGACG GCATGGATCT CAGCCCGCTG GTACCCACCG CGGTCATGCA GCGCGGCATG ATCCAGGCCG AGCATCCGGA ACTCAGTCGT CAGGAGGTCG AGCAGGTCAT CGCGGCCGGT CTGGCGGCGC TGTCCATCGG TGAGGATCCG CGCAACATGG ATCCGGACTG GCTGGCAGCT CTGCCGGGCG ATCTCGAGCG GGCCTGGCTG GACCTGTCCC TGGATCGGGG CGGTGACAGC GAGCTTGTGC CCCAGCTGCT GAACATGCTG GAGGCGGCGG CGCCGTGGCT GGCGGCCCTG CTGGCGGTCT GTGCCCTGGC GGCCATGCAG TCCACCGGCG CGGCGTACAT GTCCACCACC AGTGGCATGT TTACCCGTGA CCTGCTGCGT CGCTACATCA TGCCCAGTGC CAGCAACCAG GCCCAGGTGG TGGCGGGCCG GGTCTTCGTC ACCATCCTGG TGCTTGCGGC CCTGACCGTG GCGACGGTGA CCACCGACGC CCTGGTGCTG CTCGGCGGTC TGGCCACCGC CATGGGTACG CAGATGTGGG TGCCGCTGGC CGCCATCTGC TTCTTCCCCT GGCTTACCCG TCCGGGTGTG GTCTGGGGGC TGGGCGTTGG CATCGTTGCC GTGCTGATGA CCGAGAACAT CGGCATCGAC CTGCTGGCGG CGGCGGGCGT CGACGTGCCC TGGGGCCGTT GGCCGCTGAC CATCCACTCT GCCGGGTGGG GCCTGGTGCT TAACGCCCTG GTGGCCGTGG TCGTCTCGGC GATGACGCAG AACAACAAGG AAGACTACGA TCACCGCATG ACGGTGCACG CCTTCCTTCG CGAGCATGCG TCACTGCCGG CCGAGAAGCG CCACCTGATC CCGATCGCCT TCACCATCGT CATCGGCTGG TGGATCTTCG CCTTCGGTCC GGGCGCCCTG CTGGGCAACT GGGTCTTCGG TGATCCGACC AACCCCGATA CCTGGTGGTT CCTCGGCCTG CCGTCCATCG TGGTCTGGCA GCTGCTGTGG TGGGTGATCG GTATCTACAT GATGTGGTTC ACCTGCTACA AGTGTGAGAT GAGCACCGTG CCCGAGAAGG AAATCGAGGT CCTCTTCGAC GAGGATCAGG GCAAGGCCCG CTACGACGTG AGCCGTCCGT AA
|
Protein sequence | MSAMAIWLLV FVGLYWGYCI FWGIKGYLWS RTASDYFVAG RSVSMWVFIL AATATSFSGW TFVGHPGLIY EDGLQYAYAS FYSICIPFTG MLFLKRQWMI GKRWGYVTPG EMFADYFRTD SIRILILIVA LIFAVPYLGI QLRASGFLFH VLTDGWMGVE IGMWLLSAVV LFYVASGGLR AVAYVDAAQA VLLIVGIFVI GVVTLYYIGG WNNFTHVIAG LVEWETATGG AGEVSPGMVP GDSGASGYVA IPGAIQWVGA AANAQGGVWT GVMNITYMLA LGGIMASPSF TMWAFSNQNP RPFAPQQTWM SPVGVGALMF TFLAIQAMAT HGLGANTEFA KDIFSDEHGE QLAEYRTLFE ASEEHQGLVD EVRERLDAGE SLDGMDLSPL VPTAVMQRGM IQAEHPELSR QEVEQVIAAG LAALSIGEDP RNMDPDWLAA LPGDLERAWL DLSLDRGGDS ELVPQLLNML EAAAPWLAAL LAVCALAAMQ STGAAYMSTT SGMFTRDLLR RYIMPSASNQ AQVVAGRVFV TILVLAALTV ATVTTDALVL LGGLATAMGT QMWVPLAAIC FFPWLTRPGV VWGLGVGIVA VLMTENIGID LLAAAGVDVP WGRWPLTIHS AGWGLVLNAL VAVVVSAMTQ NNKEDYDHRM TVHAFLREHA SLPAEKRHLI PIAFTIVIGW WIFAFGPGAL LGNWVFGDPT NPDTWWFLGL PSIVVWQLLW WVIGIYMMWF TCYKCEMSTV PEKEIEVLFD EDQGKARYDV SRP
|
| |