Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0471 |
Symbol | |
ID | 9144337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 500616 |
End bp | 502244 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | sulphate transporter |
Protein accession | YP_003635585 |
Protein GI | 296128335 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGACG CCCCCCGCCC GCCCCTGCCC TCCGGCAGCG GCGCCGACGA CGCGCCGGAC GACGTCGCCC CGGCGCCCCG CTCGACCGCC GCCGGCACGC CCGCCGCCGA CGGCGCCACT CACGCCACCC CTGCCCCCGG CCCGCCCGAG GCCGACACGT CGCACTCCGT CCTCGCCGCC CTGCGCTCGC CGCGGCGCCT GCGGACCGAG CTGCTCGCCG GGCTCGTCGT GGCGCTCGCG CTCATCCCCG AGGCGATCGC GTTCTCCGTC ATCGCGGGCG TCGACCCGCG CGTGGGCCTG TTCGCGTCGT TCACGATGGC CGTGAGCATC GCGTTCCTCG GCGGACGGCC CGCGATGATC TCCGCGGCGA CCGGTGCCGT CGCGCTGGTC GTGGCCCCGG TTGCCCCGCG CCACGGGCTC GACTTCCTCC TGGCGACGAT CGTGCTGGCC GGTGTGCTCC AGGTGCTGCT CGGGCTGCTG GGCGTCGCAC GGCTGATGCG GTTCGTGCCG CGCTCGGTGA TGGTCGGGTT CGTCAACGCG CTGGCGATGC TCGTGTTCAT GTCGCAGGTG CCGCACCTGA CGGGCGTGCC GTTCCTCGTC TACCCGCTGG TCGCCGCCGG CGTCGTCGTG ATCGTGCTGC TGCCACGCTG GACGACGGTC GTGCCGGCGC CCCTCGTCGC GGTCGTCCTG CTCACCGCCG CCACCGTCCT GGGCGCGCTG CAGGTCCCGA CGGTCGGCGA CGAGGGCGAG CTGCCGGAGT CCCTGCCGGT GCTCCACGTG CCCGACGTGC CGTTCACGCT CGAGACCCTG CGGATCATCG CGCCGTACGC GCTCGCGGTC GCGCTCGTCG GCCTCCTGGA GTCGCTGCTG ACCGCCAAGC TCGTCGACGA CGTCACCGAC ACGCACTCGG ACAAGACGCG CGAGGCGTGG GGCCAGGGCG GCGCCAACAT CGTCACCGGC ATGCTCGGCG GCATGGGCGG CTGCGCCGTC ATCGGCCAGA CGATGATGAA CGTCAAGATC TCCGGCGCCC GCACGCGCAT CTCGACGTTC CTTGCCGGGG TCTTCCTGCT CGTCCTCGTC GTGGGGCTCG GCGACGTCGT CGCGGTCGTG CCGATGGCCG CGCTGGTCGC GGTGATGATC ATGGTGTCCG TCGGTGCGTT CGACTGGCAC TCGGTCCACC CGCGCACGCT GCGCCGCATG CCCCGCTCGG AGACGGCCGT GATGCTCACG ACCGTGCTCG TCACGGTCGT GTCGCACAAC CTCGCGTTCG GCGTCGGTGC GGGCGTGCTG CTGGCGACCC TGCTGTTCGT GCGGCGCGTC GCGCACGTCA CCACGGTCAC ACGGCTCGAC GGCGACGACG ACGGGCCGCG CGTGTACGCC GTCGAGGGTG CGCTGTTCTT CGCGTCGTCC AACGACCTCG TCTACCGGTT CGACTACGCC GGGGACCCGC AGGACGTCGT CATCGACCTG TCGAAGGCGC ACGTGTGGGA CGCGTCGGCC GTCGCCACGC TCGACGCGAT CCGCCACAAG TACGCGTCGA AGGGCAAGAC CGTGACGATC GTGGGCACGG ACCCGGTCAG CGCGGAGCGC ATGGTGCGGA TGGCGGGGGA GCCGGGCGGC GGGCACTGA
|
Protein sequence | MPDAPRPPLP SGSGADDAPD DVAPAPRSTA AGTPAADGAT HATPAPGPPE ADTSHSVLAA LRSPRRLRTE LLAGLVVALA LIPEAIAFSV IAGVDPRVGL FASFTMAVSI AFLGGRPAMI SAATGAVALV VAPVAPRHGL DFLLATIVLA GVLQVLLGLL GVARLMRFVP RSVMVGFVNA LAMLVFMSQV PHLTGVPFLV YPLVAAGVVV IVLLPRWTTV VPAPLVAVVL LTAATVLGAL QVPTVGDEGE LPESLPVLHV PDVPFTLETL RIIAPYALAV ALVGLLESLL TAKLVDDVTD THSDKTREAW GQGGANIVTG MLGGMGGCAV IGQTMMNVKI SGARTRISTF LAGVFLLVLV VGLGDVVAVV PMAALVAVMI MVSVGAFDWH SVHPRTLRRM PRSETAVMLT TVLVTVVSHN LAFGVGAGVL LATLLFVRRV AHVTTVTRLD GDDDGPRVYA VEGALFFASS NDLVYRFDYA GDPQDVVIDL SKAHVWDASA VATLDAIRHK YASKGKTVTI VGTDPVSAER MVRMAGEPGG GH
|
| |