Gene Hhal_1911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1911 
Symbol 
ID4710802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2106791 
End bp2108284 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content64% 
IMG OID639856384 
Productsulphate transporter 
Protein accessionYP_001003477 
Protein GI121998690 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.42059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTGGG ATCGGATTCA ACGGGAGTGG TTCGGCAACG TGCGGGCCGA CGTGCTGGCG 
GGCATCATCG TTGCGCTGGC GTTGATCCCT GAGGCGATTG CCTTTTCGAT CATGGCGGGC
GTCGACCCGC AGGTGGGGCT CTACGCCTCG TTCTGCATCG CCGTGGTGAT CGCCTTCACC
GGCGGCCGGC CCGGCATGAT CTCCGGGGCG ACGGCGGCGA TGGCGCTGCT GATGATCACC
CTGGTCAAGG ACCACGGGCT GGAGTACCTG CTGTTCGCTA CGCTGCTGAC CGGCGTGCTG
CAGATCATCG CCGGCTACCT GCGGGTCGGG CAGTTGATGC GCTTTATCTC CAACTCGGTG
ATGACCGGGT TCGTGAACGC CCTGGCGATC CTCATCTTCA TGGCGCAGCT ACCCGAGCTG
ATCGATGTCC CCTGGGCGGT GTACCCCATG GTCGCCGGTG GCCTGGCGCT GATCTATCTG
CTCCCCTACC TGACGCGGCT GGTGCCGTCG CCGCTGGTCA CCATCGTGGT GCTGACCACG
ATCGCCTGGT TCCTCGGTAT GGACGTCCCC ACCGTCGGTG ATATGGGCAA GTTGCCGGAT
ACCCTGCCGG TCTTCCTCTG GCCCGACGTC CCGTTGAACC TAGAGACGTT GTGGATCGTC
CTGCCTTATG CCTTGGCGCT GACCGTGGTG GGGCTGCTCG AGTCGATGAT GACCCAGGGG
ATCGTCGACA ACCTGACCGA TACGGAGACG GACCGCGACC GCGAGTGCAA GGGGCAGGGC
ATTGCCAACA TCGCCGCCGG TAGTGTGGGC GGCATGGCCG GCTGCGCGAT GATCGGTCAG
TCCATCATCA ATGTGAAATC CGGTGGCCGG GGGCGGCTGT CCTCGCTGGT GGCCGGTGTC
GTCCTGCTGG TCCTGGTGGT CTTCCTCACC CCGCTGCTGG AGATGATCCC CATGGCCGCC
CTGGTTGCCG TGATGATCAT GGTGGCCATC GGTACCTTTA GTTGGCGGTC GCTTCGCGAT
ATGAAGGACC ACCCGCTGAG TACCAACATC GTCATGGTCA GCATGGTGGG CGTGACCGTG
TTCACCCACA ATCTGGCCAT CGGCGTGCTC ACCGGCGTGC TCCTCGCCTC GCTGTTCTTC
GCCAACAAGG TCAGTCGCTT TATGTACGTC CGCTCCGACG TGCAGTCGCT GCCCGATGAC
CAGACTCTGC ACCGGCGCTA CGAGGTGGTC GGCCAAGTCT TCTTCGCCTC CGCCGAGCGC
TTCCGGGAGT CCTTCGACTT CGGCGAGGAC CTGGATACGG TCACCATCGA CCTGACCCGG
GCGCACTTCT GGGACATTAC GTCGGTGGCA GCGCTTGACC GGGTCGTCAT AAACTTCCGC
CGCGAAGGCG TTGAGGTGGA GATCGTCGGC ATGAATGAGG CGACGGCCAC CATCGTTGAT
CGCTACGCGG TCCATAACGA GCCGGACGCC GTCGAGCGGC TGATGGGCCA CTGA
 
Protein sequence
MNWDRIQREW FGNVRADVLA GIIVALALIP EAIAFSIMAG VDPQVGLYAS FCIAVVIAFT 
GGRPGMISGA TAAMALLMIT LVKDHGLEYL LFATLLTGVL QIIAGYLRVG QLMRFISNSV
MTGFVNALAI LIFMAQLPEL IDVPWAVYPM VAGGLALIYL LPYLTRLVPS PLVTIVVLTT
IAWFLGMDVP TVGDMGKLPD TLPVFLWPDV PLNLETLWIV LPYALALTVV GLLESMMTQG
IVDNLTDTET DRDRECKGQG IANIAAGSVG GMAGCAMIGQ SIINVKSGGR GRLSSLVAGV
VLLVLVVFLT PLLEMIPMAA LVAVMIMVAI GTFSWRSLRD MKDHPLSTNI VMVSMVGVTV
FTHNLAIGVL TGVLLASLFF ANKVSRFMYV RSDVQSLPDD QTLHRRYEVV GQVFFASAER
FRESFDFGED LDTVTIDLTR AHFWDITSVA ALDRVVINFR REGVEVEIVG MNEATATIVD
RYAVHNEPDA VERLMGH