Gene Paes_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1049 
Symbol 
ID6459891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1154267 
End bp1155769 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content52% 
IMG OID642725049 
Productsulphate transporter 
Protein accessionYP_002015735 
Protein GI194333875 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.272014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000191113 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATATTT CAATACAGCA GACATGGCAA AGGGAGTGGT TTTCAAATAT CCGGGGGGAT 
CTGCTCGCGG GGCTTGTCGT GGCTCTGGCA CTGATACCTG AAGCAATAGC TTTTTCAATC
ATAGCCGGGG TTGATCCTAA AATAGGCTTG TACGCTTCGT TCTGTATTGC TATTGTCGTC
GCTGTTACCG GTGGACGCCC GGCAATGATT TCCGCTGCTA CAGGAGCTAT GGCGCTTCTG
ATGGTCAGTC TGGTGCGCGA CCATGGTCTT CAATATCTTT TTGCGGCGAC CATTCTGACC
GGGATACTGC AGGTTGCTGC GGGTTATCTG AAGCTTGGCA GCCTGATGCG GTTTGTGTCC
CGATCTGTTG TGACGGGGTT TGTCAACGCT CTGGCAATTC TGATTTTTAT GGCGCAGCTT
CCCGAGCTTT TCGACGTAAC CTGGCACGTC TACGCGATGA CTGCCGCTGG TCTTGCTATC
ATCTATCTCT TTCCGTACGT TCCTGTTCTG GGAAAAAATA TCCCTTCGCC GCTGGTCTGT
ATTGTCGTTG TTACCGCTGC AACGATATGG CTCGGGCTTG ATATCCGGAC CGTTGGCGAT
ATGGGCCAGC TCCCCGATAC GCTGCCGGTG TTTCTCTGGC CGGAAGTTCC ACTGAACCTT
GAGACCCTCC AGATTATTCT GCCTTATTCT GCCGCGCTTG CTGTGGTCGG GCTGCTCGAG
TCCATGATGA CAGCAACTAT TGTCGACGAT CTGACCGATA CGCCCAGCGA TAAAAATCGA
GAATGCAAGG GGCAGGGGAT TGCCAATATC GGAGCCGGAT TGCTTGGAGG CATGGCCGGG
TGCGGCATGA TCGGCCAGTC GGTGATCAAT GTGAAGTCCG GAGGGAGGGG GAGACTCTCT
TCGTTTATTG CCGGTCTGTT TCTGCTGATT ATGGTTGTGT TTCTTGGCGA TCTTCTCAAA
CAGATCCCTA TGGCGGCGCT TGTCGCGGTG ATGATCATGG TCTCTATCGG TACCTTTTCC
TGGGATTCAC TGCTCAATCT GACAAAGCAT CCTCTGTCGA CCAATATTGT TATGGTTGCA
ACGGTTATCG TGGTGGTTGC AACGCATAAT CTTGCTATCG GCGTCTTTGT CGGGGTTCTG
CTGGCGTCGC TCTTCTTTGC AAGCAAGGTC GGTCATTTTA TGATAATACG CACCGAGATG
AACGAAGCGC TTCAGAAGCG GACGTATGTG GTGACAGGAC AGGTGTTTTT CGCTTCCGCG
GACAAGTTTG TCGAAGCGTT TGATTTCAAG GAGGTGCTCC GCTGTGTCGT TATCGATCTT
ACCCATGCGC ATTTCTGGGA TATCAGTGCT GTAGCAGCGC TTGACAAAGT GGTCGTCAAG
TTTCGCCGAG AGGGAACCCA CGTAGACATT ATCGGTATGA ATGAGGCCAG TACAACAATC
GTCGATCGTT TCGGCGTGCA CGACAAGCCG GAAGAGGTTG AAAAAATTCT TGCCGGCCAT
TAA
 
Protein sequence
MNISIQQTWQ REWFSNIRGD LLAGLVVALA LIPEAIAFSI IAGVDPKIGL YASFCIAIVV 
AVTGGRPAMI SAATGAMALL MVSLVRDHGL QYLFAATILT GILQVAAGYL KLGSLMRFVS
RSVVTGFVNA LAILIFMAQL PELFDVTWHV YAMTAAGLAI IYLFPYVPVL GKNIPSPLVC
IVVVTAATIW LGLDIRTVGD MGQLPDTLPV FLWPEVPLNL ETLQIILPYS AALAVVGLLE
SMMTATIVDD LTDTPSDKNR ECKGQGIANI GAGLLGGMAG CGMIGQSVIN VKSGGRGRLS
SFIAGLFLLI MVVFLGDLLK QIPMAALVAV MIMVSIGTFS WDSLLNLTKH PLSTNIVMVA
TVIVVVATHN LAIGVFVGVL LASLFFASKV GHFMIIRTEM NEALQKRTYV VTGQVFFASA
DKFVEAFDFK EVLRCVVIDL THAHFWDISA VAALDKVVVK FRREGTHVDI IGMNEASTTI
VDRFGVHDKP EEVEKILAGH