Gene Cfla_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2042 
Symbol 
ID9145938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2278809 
End bp2280503 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content76% 
IMG OID 
Productsulphate transporter 
Protein accessionYP_003637136 
Protein GI296129886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC TCGCCCCCGT GGTCCGGCGC ATCCGCGCCC TCGCCCCCCA CCGCGACGAC 
TACGCCGACC TGCCCCGCTC GTGGCGCGCC GACCTGCTCG CCGGCGTCAC CGTCGCCGTC
GTCGCGCTGC CCCTCGCCCT CGGCTTCGGC GTGGCCTCCG GGCTCGGCGC CGCCGCCGGG
CTCGTCACCG CCGTGGTGGC CGGCGCGATC GCCGCCGTCC TCGGCGGCTC CCACCTGCAG
GTCAGCGGAC CGACGGGGGC GATGACGGTC GTCCTGCTGC CCGTCGTCGC CCGGCACGGC
GCCGAAGCCG TGCCGATCGT GGCGATCATG GCCGGCGGCA TCGTGATCCT CGCCGGTGTG
CTCGGCATCG GCCGGCTCGT CGCGTACATC CCGTGGCCCG TCGTCGAAGG GTTCACGTGC
GGCATCGGCG TCGTCATCGC GCTGCAGCAG GTGCCGCTCG CGCTGGACAC TCCGCGCGCC
GAGGGCGAGA ACGCCGGGCT CGTCGCGCTG CGCACCCTCG GCGACGTCGA CTGGTCGCAG
GCCGTCGCGC CGCTCGCGCT CGTCGCGCTC GTCGTCGCCG TCATGCTGGC CCTCCCGCGG
CTGCGCCAGG GGCTGCCCGC CTCGCTCGTG GCCGTCGTGC TCGCCACCGC CGTCGCCGAG
GCCGCCCGCC TCGACGTCGA CCGCATCGGC GTCCTGCCCT CGGCGCTGCC CGCCCCCCAC
CTGCCGATCG TGGACCTGGC CACCACCAGC GCGCTGTTCT CCGCCGCCCT CGCGGTCGCT
GCGCTCGCCG CCCTGGAGTC GCTGCTGTCG GCGCGCGTCG CGGACGGCAT GGCCGACGAC
ATCGGACGCA CGCGCCCCAA CCGCGAGCTC GTCGGCCAGG GCGTCGCCAA CGTCGCGTCC
GGCCTGTGCG GCGGCCTGCC CGCGACGGGC GCGATCGCCC GCACCGCCGT CAACGTGCGC
GCCGGCGCGC GCACCCGTGT CGCGGCCCTC ACCCACGCCC TCGTGCTCAC CGCGATCGTG
TACCTGGCCG CACCGCTCGT GGGGCGGATC CCGCTGTCGG CGCTCGCGGG CGTGCTGCTC
GTCACGTCCG CACGCATGGT CGACCTGCCG ACCGCGCGCG CGATCTGCCG GTCGGGGCGC
TCCGGAGCGC TGGTGTTCGG CGTGACGCTC GCCGTGACGG TCGTGTTCGA CCTCGTCATG
GCCGTCGAGG TGGGTGTCGC GGTCGCCGCC GTGCTGGCGC TGCGCGCGGT CGCACGCAGC
AGCGGGCTGC ACCGCGAGCC GCTGGCCGAG GAGTCCGGCG ACCGGCTCAC AGCCGACGAC
GAGCACGCGC TGCTGCACGA GCACATCGCG GTGTACCGCA TCGACGGTGC GCTGTTCTTC
GCCGACGTGC GCCGGTTCCT CGACGAGCTG GCGCTCGTCA GCGACGTCCG CGTCGTCGTG
CTGCGCCTGG GCAACGTGCG CGTGCTCGAC GCGAGCGGCG CGAACGCCCT CGTCGAGATC
GTCACCGACC TGCGCCGCCG CGGGATCGTC GTGCTGCTCA AGGGGCTGCG TCCCGAGCAC
CGCAAGCTCG CCGAGGGCAT CGGGGTGCTC GACGCGCTCG GCGAGCCGCG GCACCTGTTC
GACGACCTGG ACGACGCACT CGAGCACGCC CGCTCGCACG TGCGCCGCGC CCACGCCCCG
ACCAGGATGG GATGA
 
Protein sequence
MTDLAPVVRR IRALAPHRDD YADLPRSWRA DLLAGVTVAV VALPLALGFG VASGLGAAAG 
LVTAVVAGAI AAVLGGSHLQ VSGPTGAMTV VLLPVVARHG AEAVPIVAIM AGGIVILAGV
LGIGRLVAYI PWPVVEGFTC GIGVVIALQQ VPLALDTPRA EGENAGLVAL RTLGDVDWSQ
AVAPLALVAL VVAVMLALPR LRQGLPASLV AVVLATAVAE AARLDVDRIG VLPSALPAPH
LPIVDLATTS ALFSAALAVA ALAALESLLS ARVADGMADD IGRTRPNREL VGQGVANVAS
GLCGGLPATG AIARTAVNVR AGARTRVAAL THALVLTAIV YLAAPLVGRI PLSALAGVLL
VTSARMVDLP TARAICRSGR SGALVFGVTL AVTVVFDLVM AVEVGVAVAA VLALRAVARS
SGLHREPLAE ESGDRLTADD EHALLHEHIA VYRIDGALFF ADVRRFLDEL ALVSDVRVVV
LRLGNVRVLD ASGANALVEI VTDLRRRGIV VLLKGLRPEH RKLAEGIGVL DALGEPRHLF
DDLDDALEHA RSHVRRAHAP TRMG