Gene Cfla_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0471 
Symbol 
ID9144337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp500616 
End bp502244 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content73% 
IMG OID 
Productsulphate transporter 
Protein accessionYP_003635585 
Protein GI296128335 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACG CCCCCCGCCC GCCCCTGCCC TCCGGCAGCG GCGCCGACGA CGCGCCGGAC 
GACGTCGCCC CGGCGCCCCG CTCGACCGCC GCCGGCACGC CCGCCGCCGA CGGCGCCACT
CACGCCACCC CTGCCCCCGG CCCGCCCGAG GCCGACACGT CGCACTCCGT CCTCGCCGCC
CTGCGCTCGC CGCGGCGCCT GCGGACCGAG CTGCTCGCCG GGCTCGTCGT GGCGCTCGCG
CTCATCCCCG AGGCGATCGC GTTCTCCGTC ATCGCGGGCG TCGACCCGCG CGTGGGCCTG
TTCGCGTCGT TCACGATGGC CGTGAGCATC GCGTTCCTCG GCGGACGGCC CGCGATGATC
TCCGCGGCGA CCGGTGCCGT CGCGCTGGTC GTGGCCCCGG TTGCCCCGCG CCACGGGCTC
GACTTCCTCC TGGCGACGAT CGTGCTGGCC GGTGTGCTCC AGGTGCTGCT CGGGCTGCTG
GGCGTCGCAC GGCTGATGCG GTTCGTGCCG CGCTCGGTGA TGGTCGGGTT CGTCAACGCG
CTGGCGATGC TCGTGTTCAT GTCGCAGGTG CCGCACCTGA CGGGCGTGCC GTTCCTCGTC
TACCCGCTGG TCGCCGCCGG CGTCGTCGTG ATCGTGCTGC TGCCACGCTG GACGACGGTC
GTGCCGGCGC CCCTCGTCGC GGTCGTCCTG CTCACCGCCG CCACCGTCCT GGGCGCGCTG
CAGGTCCCGA CGGTCGGCGA CGAGGGCGAG CTGCCGGAGT CCCTGCCGGT GCTCCACGTG
CCCGACGTGC CGTTCACGCT CGAGACCCTG CGGATCATCG CGCCGTACGC GCTCGCGGTC
GCGCTCGTCG GCCTCCTGGA GTCGCTGCTG ACCGCCAAGC TCGTCGACGA CGTCACCGAC
ACGCACTCGG ACAAGACGCG CGAGGCGTGG GGCCAGGGCG GCGCCAACAT CGTCACCGGC
ATGCTCGGCG GCATGGGCGG CTGCGCCGTC ATCGGCCAGA CGATGATGAA CGTCAAGATC
TCCGGCGCCC GCACGCGCAT CTCGACGTTC CTTGCCGGGG TCTTCCTGCT CGTCCTCGTC
GTGGGGCTCG GCGACGTCGT CGCGGTCGTG CCGATGGCCG CGCTGGTCGC GGTGATGATC
ATGGTGTCCG TCGGTGCGTT CGACTGGCAC TCGGTCCACC CGCGCACGCT GCGCCGCATG
CCCCGCTCGG AGACGGCCGT GATGCTCACG ACCGTGCTCG TCACGGTCGT GTCGCACAAC
CTCGCGTTCG GCGTCGGTGC GGGCGTGCTG CTGGCGACCC TGCTGTTCGT GCGGCGCGTC
GCGCACGTCA CCACGGTCAC ACGGCTCGAC GGCGACGACG ACGGGCCGCG CGTGTACGCC
GTCGAGGGTG CGCTGTTCTT CGCGTCGTCC AACGACCTCG TCTACCGGTT CGACTACGCC
GGGGACCCGC AGGACGTCGT CATCGACCTG TCGAAGGCGC ACGTGTGGGA CGCGTCGGCC
GTCGCCACGC TCGACGCGAT CCGCCACAAG TACGCGTCGA AGGGCAAGAC CGTGACGATC
GTGGGCACGG ACCCGGTCAG CGCGGAGCGC ATGGTGCGGA TGGCGGGGGA GCCGGGCGGC
GGGCACTGA
 
Protein sequence
MPDAPRPPLP SGSGADDAPD DVAPAPRSTA AGTPAADGAT HATPAPGPPE ADTSHSVLAA 
LRSPRRLRTE LLAGLVVALA LIPEAIAFSV IAGVDPRVGL FASFTMAVSI AFLGGRPAMI
SAATGAVALV VAPVAPRHGL DFLLATIVLA GVLQVLLGLL GVARLMRFVP RSVMVGFVNA
LAMLVFMSQV PHLTGVPFLV YPLVAAGVVV IVLLPRWTTV VPAPLVAVVL LTAATVLGAL
QVPTVGDEGE LPESLPVLHV PDVPFTLETL RIIAPYALAV ALVGLLESLL TAKLVDDVTD
THSDKTREAW GQGGANIVTG MLGGMGGCAV IGQTMMNVKI SGARTRISTF LAGVFLLVLV
VGLGDVVAVV PMAALVAVMI MVSVGAFDWH SVHPRTLRRM PRSETAVMLT TVLVTVVSHN
LAFGVGAGVL LATLLFVRRV AHVTTVTRLD GDDDGPRVYA VEGALFFASS NDLVYRFDYA
GDPQDVVIDL SKAHVWDASA VATLDAIRHK YASKGKTVTI VGTDPVSAER MVRMAGEPGG
GH