Gene Cfla_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0053 
Symbol 
ID9143918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp67515 
End bp69161 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content75% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003635172 
Protein GI296127922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00894848 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTGCGA CCCTCCAGGC CCGGTCCGTG GCCGCGGCGT TCGGTGACCG CGAGCTGTTC 
TCCGGCCTCG ACCTCGTCGT CGCGCCCGGC GACGTCGTCG GGCTCGTCGG CCCCAACGGC
GCCGGCAAGA CGACGTTGCT GCGCATCCTT GCAGGTCAGC GGGCTCCGGA GGCGGGGGTG
GTGGCGCTGT CACCGCCGAC GGCGCAGGTC GGCTACCTGG TGCAGGAGGT CGAGCGCCGG
GCGGACGAGT CGGTGCGGAC GTTCCTCGAG CGACGGACGG GCGTGGCCGA CGCGCAGGCC
GCGATGGACG CGGCGTCCGA CGCGCTCGCG GCCGACGCGC CGGGGGCGGG CGACGCGTTC
ACGCACGCGC TCGAGCGGTG GATGGCCCTG GGTGGCGCGG ACCTCGACGC GCGGCTCGGC
GGCGTCGCCG ACGACCTCGG GCTCGCCGTC GACCTCGACC TGCCCATGAC GGCGCTGTCC
GGCGGGCAGG CCGCGCGCGT CGGGCTCGCG GCGCTGCTGC TCTCGCGCTT CGACCTGTAC
CTGCTCGACG AGCCGACCAA CGACCTCGAC GCCGACGGGC TCGACCGCCT GGAGGAGTTC
GTCGCGCAGG CGTCGGCGCC GGTCGTGGTC GTCAGCCACG ACCGGGAGTT CCTGGCGCGG
ACCGTCACCA CGGTCGTCGA GATCGACCGC AGCCTGCAGC GTGTCGCCAC CTACGGCGGC
TCCTACGACG CGTACCTCGA GGAGCGCTCG ACGGCCCGCC GGCAGGCGCG CGAGGCGTAC
GAGGACTACG CGGGCCGCCG GGACGCGCTC GCGGCCCGCG CCCGCATGCA GCGCGCGTGG
ATGGAGAAGG GCGTGCGCAA CGCGATGCGC AAGGCCACGG ACGGCGACAA GAACGTCAAG
CAGGGCCGCC GCGAGTCGTC CGAGAAGCAG GCGTCGAAGG CGCGGCAGAC CGACCGCATG
ATCGAGCGGC TCGTGGTCGT CGAGGAGCCG CGCAAGGAGT GGCAGCTGCG CATGAGCATC
GCCACCGCAC CGCGCTCGGG GTCCGTGGTC GCGACCGCGC GGGCCGCCGT CGTCCGCCGC
GGGGACTTCG TCCTCGGCCC GGTGGACCTG CAGCTCGACT GGCAGGACCG CATCGCCGTC
ACCGGGCCCA ACGGCTCGGG CAAGTCGACG CTGCTCGCCC TGCTGCTGGG GCGCCTGGCG
GCCGACGAGG GCACCGCGGC CCTCGGGTCG GGCGTGCTCG TGGGGGAGGT CGACCAGGCA
CGGGCAGCGT TCGAGCGCGA CGAGCCGCTG GGCGACGCGT TCGCGCGCGA GGTCCCGGAG
TGGACGACGG CCGACGTGCG CACGCTGCTG GCGAAGTTCG GCCTCGCCGG CCACCAGGTG
GGCCGCCCGG CGGCGTCGCT GTCCCCGGGG GAGCGCACGC GGGCGGCCCT CGCGCTGCTG
CAGGCCCGCG GCGTCAACCT GCTGGTCCTC GACGAGCCGA CCAACCACCT CGACCTGCCG
GCGATCGAGC AGCTCGAGCA GGCGATGGAG TCCTTCGACG GCACGATCCT GCTGGTCACG
CACGATCGCC GGATGCTCGA CACCGTGCGG CTGACGCGCC GCTGGCACGT CGAGGACGGG
CAGGTCACGG AGGTCCGGCC GGACTGA
 
Protein sequence
MSATLQARSV AAAFGDRELF SGLDLVVAPG DVVGLVGPNG AGKTTLLRIL AGQRAPEAGV 
VALSPPTAQV GYLVQEVERR ADESVRTFLE RRTGVADAQA AMDAASDALA ADAPGAGDAF
THALERWMAL GGADLDARLG GVADDLGLAV DLDLPMTALS GGQAARVGLA ALLLSRFDLY
LLDEPTNDLD ADGLDRLEEF VAQASAPVVV VSHDREFLAR TVTTVVEIDR SLQRVATYGG
SYDAYLEERS TARRQAREAY EDYAGRRDAL AARARMQRAW MEKGVRNAMR KATDGDKNVK
QGRRESSEKQ ASKARQTDRM IERLVVVEEP RKEWQLRMSI ATAPRSGSVV ATARAAVVRR
GDFVLGPVDL QLDWQDRIAV TGPNGSGKST LLALLLGRLA ADEGTAALGS GVLVGEVDQA
RAAFERDEPL GDAFAREVPE WTTADVRTLL AKFGLAGHQV GRPAASLSPG ERTRAALALL
QARGVNLLVL DEPTNHLDLP AIEQLEQAME SFDGTILLVT HDRRMLDTVR LTRRWHVEDG
QVTEVRPD