Gene Hhal_0554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0554 
Symbol 
ID4709689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp626608 
End bp628089 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content66% 
IMG OID639855012 
Productsodium/proline symporter 
Protein accessionYP_001002142 
Protein GI121997355 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGCG AACTCGATCT CAACATCCCG CTACTGGCGA CCTTTGTCGT CTACCTGATC 
CTGATGATCC TGGTGGGGCT GTTCGCCTAC CGCTTCACCA AGAGCCTCGC CGACTACATC
CTCGGCGGGC GTCGCCTGGG TTCGGGGACG GCGGCCCTCT CGGCCGGCGC CTCGGACATG
AGCGGCTGGC TCCTGCTTGG TCTGCCCGGT GCGGTCTACG CCTCCGGGAT GAACCAGATC
TGGATTGCGG TGGGCCTGAC CATCGGCGCC TACCTCAACT GGCAGTTCAT TGCCGAGCGT
CTGCGTCGCT ACACCGAGGT GGCGCGGGAC TCGATTACCA TCCCGGCCTA CTTCGAGAAT
CGCTTCCGGG ACCAGACCTC GGCGCTGCGC GTGGTCTCGG CCCTGGTGAT CCTGCTGTTC
TTTACCTTCT ACACCTCGGC CGGTCTGGTG GCGGCGGGGA CGCTGTTCGA GGACACCTTC
GGGGTGGAGT ACACCCTGGC GCTGGCGATC GGCGCCTCGG TGATCCTGGT GTACACGCTG
CTGGGCGGTT TCCTGGCGGT TAGCTGGACC GACTTCATCC AGGGCATCCT GATGTTCGTC
GCCCTGATCC TGGTGCCGGT GATCACCGTG ATGAACCTCG GCGGCTGGTC GGAGACCGCC
GGCGCCGTCG GGGAGCTGGA GCCGGGCGCG CTGGACGCCT TCCACGACGT GACCCTGTTC
AGCATCATCT CGCTGATGGC CTGGGGCCTG GGCTACTTCG GGCAGCCCCA CGTGCTGACC
CGTTTCATGG CCATCCGCAG TCCTCGGGAT ATCCCCGCGG CCCGGTTCAT CGGCATGAGC
TGGATGGTCT TCGCCCTGTT CGGCGCCATC TTTACGGGCT TTGCCGGCAT CGCCTTCTTC
GAGGGCACTG GCACCCTGGA CAACCCCGAG ACGGTCTTCA TGGCCCTGAT CCAGGCGCTG
TTCAATCCGT GGGTGGCCGG CTGTCTGCTG GCGGCGGTGC TGGCGGCGAT CATGAGCACC
ATCGACTCCC AGCTGCTGGT CTCCTCCTCG GCCCTGTCCG AGGACTTCTA CAAGCGCTTC
CTGCGTCCGC GGGCCGGTGA CCGGGAGCTG GTCTGGGTCG GTCGCGGTAC GGTGCTCGGC
ATCGGGATCT TCGCCACCCT GCTGGCGCTC AATCCGGATG CCGCGGTCCT CGACCTGGTC
GCTTACGCCT GGGCCGGCTT CGGCGCTGCC TTCGGGCCGG TGATCATCCT CTCGGTCTTC
TGGCGCGGTG CTACGCGCAA CGGGGCCCTG GCGGGGATCA TCGTCGGCGC GGTGACGGTG
GTGGTCTGGG ATCTGCTTGA GGGCGGGCTC TTCGATATGT ACGAGATCCT GCCCGGCTTC
ATCCTGGGTA CCCTGGCGAT CCTGATCTTC AGCCGCGTTG GTGGTCGTCC GAGTGCGGAG
ATCGAGCGGG AGTTCGACAA GGTCGAACAA GGCACCCGCT AA
 
Protein sequence
MNGELDLNIP LLATFVVYLI LMILVGLFAY RFTKSLADYI LGGRRLGSGT AALSAGASDM 
SGWLLLGLPG AVYASGMNQI WIAVGLTIGA YLNWQFIAER LRRYTEVARD SITIPAYFEN
RFRDQTSALR VVSALVILLF FTFYTSAGLV AAGTLFEDTF GVEYTLALAI GASVILVYTL
LGGFLAVSWT DFIQGILMFV ALILVPVITV MNLGGWSETA GAVGELEPGA LDAFHDVTLF
SIISLMAWGL GYFGQPHVLT RFMAIRSPRD IPAARFIGMS WMVFALFGAI FTGFAGIAFF
EGTGTLDNPE TVFMALIQAL FNPWVAGCLL AAVLAAIMST IDSQLLVSSS ALSEDFYKRF
LRPRAGDREL VWVGRGTVLG IGIFATLLAL NPDAAVLDLV AYAWAGFGAA FGPVIILSVF
WRGATRNGAL AGIIVGAVTV VVWDLLEGGL FDMYEILPGF ILGTLAILIF SRVGGRPSAE
IEREFDKVEQ GTR