Gene Hhal_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1899 
Symbol 
ID4710677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2088879 
End bp2091170 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content65% 
IMG OID639856372 
ProductNa+/solute symporter 
Protein accessionYP_001003465 
Protein GI121998678 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGCTA TGGCCATATG GCTGTTGGTG TTTGTAGGCC TGTACTGGGG CTACTGCATC 
TTCTGGGGTA TCAAGGGATA CCTTTGGTCG CGCACAGCCA GCGACTACTT CGTGGCCGGG
CGCTCCGTCA GTATGTGGGT CTTCATACTG GCGGCAACGG CCACATCGTT CTCAGGGTGG
ACGTTCGTAG GACACCCGGG CCTGATCTAC GAGGACGGCC TGCAGTACGC CTACGCCTCG
TTCTACTCCA TCTGCATCCC GTTCACCGGC ATGCTGTTCC TCAAGCGCCA ATGGATGATT
GGCAAGCGTT GGGGTTACGT CACGCCGGGC GAGATGTTCG CGGACTACTT CCGGACGGAC
TCGATCCGCA TCCTGATCCT CATTGTGGCG CTGATCTTCG CAGTGCCGTA CCTGGGGATT
CAGCTGCGCG CCTCGGGCTT CCTGTTCCAC GTGCTCACCG ACGGCTGGAT GGGCGTCGAG
ATCGGCATGT GGCTGCTGTC CGCGGTGGTG CTGTTCTACG TGGCCTCGGG CGGTCTGCGC
GCCGTGGCGT ACGTGGACGC GGCGCAGGCT GTTTTGCTCA TTGTGGGCAT CTTCGTCATC
GGCGTCGTGA CGCTCTACTA CATCGGCGGC TGGAACAACT TCACGCACGT GATTGCCGGC
CTGGTCGAGT GGGAGACCGC CACCGGTGGT GCGGGTGAGG TCTCGCCGGG CATGGTGCCG
GGGGATAGCG GTGCCAGTGG CTATGTCGCC ATCCCGGGTG CCATCCAGTG GGTCGGTGCC
GCGGCCAACG CCCAGGGCGG TGTCTGGACC GGCGTCATGA ACATCACCTA CATGCTGGCG
CTGGGCGGCA TCATGGCCTC CCCGTCGTTC ACCATGTGGG CGTTCTCGAA CCAGAACCCG
CGTCCGTTTG CGCCGCAACA GACCTGGATG TCCCCGGTGG GCGTCGGTGC CCTGATGTTC
ACCTTCCTGG CGATCCAGGC GATGGCCACC CACGGCCTGG GGGCCAACAC CGAGTTCGCC
AAGGACATCT TCTCCGACGA GCACGGTGAG CAGCTGGCCG AGTACCGGAC GCTGTTCGAG
GCCTCCGAGG AGCACCAAGG GCTGGTGGAC GAGGTGCGCG AGCGCCTGGA CGCCGGCGAG
TCCCTGGACG GCATGGATCT CAGCCCGCTG GTACCCACCG CGGTCATGCA GCGCGGCATG
ATCCAGGCCG AGCATCCGGA ACTCAGTCGT CAGGAGGTCG AGCAGGTCAT CGCGGCCGGT
CTGGCGGCGC TGTCCATCGG TGAGGATCCG CGCAACATGG ATCCGGACTG GCTGGCAGCT
CTGCCGGGCG ATCTCGAGCG GGCCTGGCTG GACCTGTCCC TGGATCGGGG CGGTGACAGC
GAGCTTGTGC CCCAGCTGCT GAACATGCTG GAGGCGGCGG CGCCGTGGCT GGCGGCCCTG
CTGGCGGTCT GTGCCCTGGC GGCCATGCAG TCCACCGGCG CGGCGTACAT GTCCACCACC
AGTGGCATGT TTACCCGTGA CCTGCTGCGT CGCTACATCA TGCCCAGTGC CAGCAACCAG
GCCCAGGTGG TGGCGGGCCG GGTCTTCGTC ACCATCCTGG TGCTTGCGGC CCTGACCGTG
GCGACGGTGA CCACCGACGC CCTGGTGCTG CTCGGCGGTC TGGCCACCGC CATGGGTACG
CAGATGTGGG TGCCGCTGGC CGCCATCTGC TTCTTCCCCT GGCTTACCCG TCCGGGTGTG
GTCTGGGGGC TGGGCGTTGG CATCGTTGCC GTGCTGATGA CCGAGAACAT CGGCATCGAC
CTGCTGGCGG CGGCGGGCGT CGACGTGCCC TGGGGCCGTT GGCCGCTGAC CATCCACTCT
GCCGGGTGGG GCCTGGTGCT TAACGCCCTG GTGGCCGTGG TCGTCTCGGC GATGACGCAG
AACAACAAGG AAGACTACGA TCACCGCATG ACGGTGCACG CCTTCCTTCG CGAGCATGCG
TCACTGCCGG CCGAGAAGCG CCACCTGATC CCGATCGCCT TCACCATCGT CATCGGCTGG
TGGATCTTCG CCTTCGGTCC GGGCGCCCTG CTGGGCAACT GGGTCTTCGG TGATCCGACC
AACCCCGATA CCTGGTGGTT CCTCGGCCTG CCGTCCATCG TGGTCTGGCA GCTGCTGTGG
TGGGTGATCG GTATCTACAT GATGTGGTTC ACCTGCTACA AGTGTGAGAT GAGCACCGTG
CCCGAGAAGG AAATCGAGGT CCTCTTCGAC GAGGATCAGG GCAAGGCCCG CTACGACGTG
AGCCGTCCGT AA
 
Protein sequence
MSAMAIWLLV FVGLYWGYCI FWGIKGYLWS RTASDYFVAG RSVSMWVFIL AATATSFSGW 
TFVGHPGLIY EDGLQYAYAS FYSICIPFTG MLFLKRQWMI GKRWGYVTPG EMFADYFRTD
SIRILILIVA LIFAVPYLGI QLRASGFLFH VLTDGWMGVE IGMWLLSAVV LFYVASGGLR
AVAYVDAAQA VLLIVGIFVI GVVTLYYIGG WNNFTHVIAG LVEWETATGG AGEVSPGMVP
GDSGASGYVA IPGAIQWVGA AANAQGGVWT GVMNITYMLA LGGIMASPSF TMWAFSNQNP
RPFAPQQTWM SPVGVGALMF TFLAIQAMAT HGLGANTEFA KDIFSDEHGE QLAEYRTLFE
ASEEHQGLVD EVRERLDAGE SLDGMDLSPL VPTAVMQRGM IQAEHPELSR QEVEQVIAAG
LAALSIGEDP RNMDPDWLAA LPGDLERAWL DLSLDRGGDS ELVPQLLNML EAAAPWLAAL
LAVCALAAMQ STGAAYMSTT SGMFTRDLLR RYIMPSASNQ AQVVAGRVFV TILVLAALTV
ATVTTDALVL LGGLATAMGT QMWVPLAAIC FFPWLTRPGV VWGLGVGIVA VLMTENIGID
LLAAAGVDVP WGRWPLTIHS AGWGLVLNAL VAVVVSAMTQ NNKEDYDHRM TVHAFLREHA
SLPAEKRHLI PIAFTIVIGW WIFAFGPGAL LGNWVFGDPT NPDTWWFLGL PSIVVWQLLW
WVIGIYMMWF TCYKCEMSTV PEKEIEVLFD EDQGKARYDV SRP