Gene Cphamn1_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1167 
Symbol 
ID6374842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1254341 
End bp1255831 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content53% 
IMG OID642683665 
Productsulphate transporter 
Protein accessionYP_001959582 
Protein GI189500112 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.422162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0121281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAAA CCTTGAAAAA AGAATGGTTT TCAAATATTC GGGGCGATCT TCTGGCTGGC 
CTCGTCGTCG CCCTTGCCCT CATACCGGAA GCAATCGCCT TCTCTATCAT CGCAGGCGTA
GATCCTAAAA TAGGGCTCTA CGCCTCTTTC TGTATTTCTG TTGTGGTCGC ATTCACCGGC
GGCAGACCGG GGATGATATC CGCCGCGACC GGAGCCATGG CGCTTCTGAT GGTCACGCTG
GTCAGAGAAC ACGGTCTGCA GTACCTTTTT GCGGCCACAC TGCTGACAGG CGCTCTGCAG
ATCATCGCGG GCTACCTCAA GCTGGGGAGT CTGATGCGAT TTGTCTCGCG ATCTGTTGTA
ACTGGTTTTG TCAACGCCCT CGCCATTCTG ATTTTCATGG CCCAGCTGCC GGAACTGACC
AATGTCAGCT GGCATGTCTA TGCGCTTACC GCCGCAGGTC TCGGCATCAT TTACCTCTTT
CCGCTCATTC CATCCATTGG AAAGTCGGTT CCCTCACCGC TGGTATGCAT TGTCGTACTT
ACCGGCGTCT CGATCCTGCT TGGCCTGGAT ATCCGCACTG TGGGAGACAT GGGCGCACTT
CCGGACACAC TGCCTGTGTT CTTATGGCCG CAGATTCCGC TGACTATGGA AACCCTGTCG
ATCATTTTCC CTTATTCAGC AGCTCTTGCC GTGGTCGGCC TGCTGGAATC CATGATGACA
GCAACCATTG TCGACGATCT GACCGACACC TCGAGCGACA AGAACCGGGA GTGTAAGGGA
CAGGGAATCG CGAACATCAC CGCAGGATTG CTGGGCGGCA TGGCAGGATG CGGGATGATC
GGGCAGTCAG TCATCAACGT CAAATCGGGA GGACGAGGTC GCCTTTCCTC ATTCGTCGCA
GGCTTTTTCC TGCTCATCAT GGTGGTCTTT CTCGGTGAAT GGCTGAAACA GATCCCGATG
GCCGCCCTTG TCGCCGTGAT GATCATGGTA TCCGTCGGCA CCTTTTCCTG GGACTCACTG
ATAAAACTGA AAAAGCACCC GCTGTCGACC AATATCGTTA TGGGGGCAAC CGTGATCGTC
GTTGTCACAA CCCATAATCT GGCTATCGGA GTCTTTGTCG GGGTATTGCT GGCTTCGATG
TTTTTTGCCA GCAAGGTCGG ACATTTCATG GTGGTAAAAA GCCTTGTAGA CGAGGCTGCC
GGTTCCAGAA CATACAAAGT GATCGGCCAG GTCTTTTTCG CTTCATCCGA CAAGTTTGTT
GAGGCATTTG ACTTCAAAGA AGCTCTGGAA ACGGTTATTA TTGATCTCTC ACATGCTCAT
TTCTGGGATA TCAGCGCGGT TTCCGCTTTC GACAAGGTCG TTATCAAGTT CAGGCGCGAA
GGAACCCATG TAGAAATTAT CGGTATGAAC GAAGCAAGCG CCACCATCGT TGACCGTTTT
GGCGTACACG ACAAGCCCGA AGAGGTAGAA AAAATTCTTG CAAGTCATTA A
 
Protein sequence
MLQTLKKEWF SNIRGDLLAG LVVALALIPE AIAFSIIAGV DPKIGLYASF CISVVVAFTG 
GRPGMISAAT GAMALLMVTL VREHGLQYLF AATLLTGALQ IIAGYLKLGS LMRFVSRSVV
TGFVNALAIL IFMAQLPELT NVSWHVYALT AAGLGIIYLF PLIPSIGKSV PSPLVCIVVL
TGVSILLGLD IRTVGDMGAL PDTLPVFLWP QIPLTMETLS IIFPYSAALA VVGLLESMMT
ATIVDDLTDT SSDKNRECKG QGIANITAGL LGGMAGCGMI GQSVINVKSG GRGRLSSFVA
GFFLLIMVVF LGEWLKQIPM AALVAVMIMV SVGTFSWDSL IKLKKHPLST NIVMGATVIV
VVTTHNLAIG VFVGVLLASM FFASKVGHFM VVKSLVDEAA GSRTYKVIGQ VFFASSDKFV
EAFDFKEALE TVIIDLSHAH FWDISAVSAF DKVVIKFRRE GTHVEIIGMN EASATIVDRF
GVHDKPEEVE KILASH