Gene Cagg_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1147 
Symbol 
ID7267895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1415059 
End bp1416825 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content55% 
IMG OID643565990 
Productsulfate transporter 
Protein accessionYP_002462493 
Protein GI219848060 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00377] anti-anti-sigma factor
[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00301297 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000165029 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGCCCG TTTCTGCGCG CCAACCGGGA GTGCAGGCAG CGTTGCTGAT GATGATTACA 
CGCTACCTGC CATTCCTGAA CTGGTTACGT ACCTACCGGC TCGAGCATCT GCCTAGCGAC
ATAGTGGCGG GAATCGTTAC CGCCATTATG CTCATTCCGC AGAGCATGGC TTACGCCCAG
TTGGCCGGTC TTCCACCGCA GGTTGGTCTG TACGCATCGG TCGCGCCACT GATTGTCTAC
GCCCTCCTCG GTACCTCAGG TCAACTTTCG GTAGGTCCGG TCGCGATCAC GTCCCTCCTC
GTGTTCAACG GGGTGAGTGC GTTGGCTGTG CCGGGTACTG AGCGCTATTT TCAGTTGGTG
CTCTTGCTGG CCTTTATGGT TGGTGCGATC AAATTGGCTT TGGGGATATT CAGACTGGGT
GTGATCCTTA ACTTCATTTC ACATCCGGTG TTGGCCGCGT TTACCAGTGC CTCAGCACTG
ATTATTGCGG TGGGTCAATT GAAATATATT CTCGGTTATC GGATCGGTGG TGAACATATC
TACGAAACGA TTGCACAAGC AATCGCCGGT CTGAGCCAGA CCAACGTTGC CACGTTGGTG
ATTGGTTTGG CAAGTATTGG TTTGTTACTC TTCTTTCGGC AGGGGCTGCG TCCGTTGTTG
CGCCGGGCCG GGTTATCACC GCTAGCAGTC ACCTTAATCG TGAGTGGTGC GCCACTGTTG
GCGGTGATTT TTGGTATTCT AGTGGCACAA GCATTCCGCC TCGATCAAGT TGCCGGTGTC
GCGGTTGTAG GGACGATCCC GCCCGGCCTA TCACCGATTA GCTCACCTGT CTTAACCATA
GCCGACGCAC AAGCCTTACT GCCGACAGCT CTCACGATTG TGTTGGTCAG TGTGGTTGAG
TCGATAGCCG TCGCTAAGGC ATTGGCCAGC AAGCGTCGTC AAGCAATTGA TCCCGATCAG
GAATTGGTTG CGCTTGGTGC TGCCAATATT GCGGCCGGCT TTTTTAGCGG TTACCCGGTG
ACGGGTGGGT TTGCTCGCTC GGTGGTCAAT GCGCAGGCCG GTGCGATCAC CGGTTTGGCA
TCGTTGATCA CTGCCGCCAT GATTGCGCTG ATCCTGCTCT TCTTTACGTC GGTCTTCTAT
TACCTACCGC AGGCAGTGTT GGCGGCAACC GTGATCGTAG CCGTCATCGG GCTGGTTGAT
CTGCATGAGC CGCAGCAAAT CTGGCGCACT AATCGCGGGG ATGCCTTTAC ATGGCTTATT
ACGTTTGTGG CGGTGCTGGC TCTTGGGATT GAGACCGGTA TCTTTGCCGG CGTCGCTTCG
GCGCTCATTC TTTACCTCTG GCGTACTAGC CGTCCGCATA TTGCCATCGT TGGTCGGCTG
GGGAACAGTG AAGTCTACCG CAACGTCGAG CGTCATCCGG TCAAGACATG GCCGCACGTA
GTGGCCGTCC GTGTTGACGA GAGCATCTAT TTCGCCAATA CGCGCTATCT CGAGCAGACG
TTATTGCGGA TTGTGGCTGA ACGACCTGAG GTGAAGCATT TGGTGTTGAT CGGTTCGGCC
ATTAATTTTA TCGACTCGAG CGCCTTGCAT ACTCTCCATA ACTTAATTGA TGGTCTGCGC
GATGCCGGTG TCGAGTTTCA TTTGGCCGAT ATTAAAGGAC CGGTCATGGA TCGGCTCAAG
CGGTCGGAAT TGCTCGATAA GATCGGGCAA GATCACATCC ACCTGACAAC GCACTCGGCG
ATGCTAGCGT TGGGTTGCCG AGATTGA
 
Protein sequence
MMPVSARQPG VQAALLMMIT RYLPFLNWLR TYRLEHLPSD IVAGIVTAIM LIPQSMAYAQ 
LAGLPPQVGL YASVAPLIVY ALLGTSGQLS VGPVAITSLL VFNGVSALAV PGTERYFQLV
LLLAFMVGAI KLALGIFRLG VILNFISHPV LAAFTSASAL IIAVGQLKYI LGYRIGGEHI
YETIAQAIAG LSQTNVATLV IGLASIGLLL FFRQGLRPLL RRAGLSPLAV TLIVSGAPLL
AVIFGILVAQ AFRLDQVAGV AVVGTIPPGL SPISSPVLTI ADAQALLPTA LTIVLVSVVE
SIAVAKALAS KRRQAIDPDQ ELVALGAANI AAGFFSGYPV TGGFARSVVN AQAGAITGLA
SLITAAMIAL ILLFFTSVFY YLPQAVLAAT VIVAVIGLVD LHEPQQIWRT NRGDAFTWLI
TFVAVLALGI ETGIFAGVAS ALILYLWRTS RPHIAIVGRL GNSEVYRNVE RHPVKTWPHV
VAVRVDESIY FANTRYLEQT LLRIVAERPE VKHLVLIGSA INFIDSSALH TLHNLIDGLR
DAGVEFHLAD IKGPVMDRLK RSELLDKIGQ DHIHLTTHSA MLALGCRD