Gene Cfla_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1994 
Symbol 
ID9145889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2220389 
End bp2222239 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content74% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003637088 
Protein GI296129838 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0769116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGA GCACGTCGCC GGCCGCGGCC CGCCAGGACA GCCGGATGGC GCCGGCGAGC 
ACCCTCGGCG TGCTCGCCAC CCTGCGGCGC GCCGCGCAGG TCTCGCCCGA GCTGCTCGAC
GGCCTCACGG TCACGCTCGC CCTGGCGGTC GTCGCGGCGA CCGCTCGCGT GCTCGTCCCG
CTCGCGGTGC AGCAGACGGT CGACACCGCG ATCCTCGCCC CGGGTGGCGT GGACGTACGC
CGTGCGGCGC TGCTCGTCGG GGTCGCCGCG CTCGGCCTGG CGGTGGGTGC GGGATGCTCC
GCGCTGGTCA ACGTGCGTCT CTTCCGGTCC AGCGAGGCGG GACTCCTCAC GCTGCGGAGC
CGGGCGTTCC GGCACGTGCA CGACCTGTCG GTCCTCACGC AGAACAGCGA GCGCCGCGGC
TCGCTGGTCT CGCGCGTCAC CTCGGACGTC GACACGATCT CGATGTTCGT GCAGTGGGGC
GGCATCATGC TGCTCGTCTC CGTGCTGCAG ATCCTCGTGG CCACGACGCT GATGGCGGTC
TACTCGTGGC AGCTCACGCT GCTCGTGTGG GCGTGCTTCC TGCCGCTCCT GGTCGTGCTG
CCACGGCTGC AGCGCGGTGT GAACCGCCGC TACGCGGCCG TGCGCGAGCA GTACGGCGCC
ATGCTCGGTG CGGTCTCGGA GGCCGTCGTG GGGGCCGAGA CGATCCGCGC CTACGGTGTG
GCGGCACGCA CGCAGCGCCG CATCGACGCC GCCGTCGCCG GCACCCGGCG CGCCATGGTG
CGCGCCCAGA ACCTCGTCGC GGTGGTCTTC TCGTCCGGGG TGCTCGTGGC GAACCTCGTG
CTCGCCGTCG TCGTCGTCGC CGGCACGTGG CTGGGGGTGG CGGGCGACCT CACGGTCGGT
CGGGTGCTGG CGTTCCTCTT CCTCGTGCAG CTCTTCACCG GTCCGGTGCA GATGGCCACC
GAGATCCTCA ACGAGCTGCA GAACGCGATC TCCGGCTGGC GCCGCGTGCT CGGCGTCCTC
GAGACCCCCG TGGACGTCCC CGAGCCCGGT GACGACGTGG TGCCGAGCCC GCGCGGCCCG
GCGGCGCTGA CCTTCACCGG AGTGGGCTAC GCCTACCCCG ACGGCCCGCC GGTGCTGCAG
GGCGTCGACC TGCACGTCGC GGCCGGGACG TCGGTCGCCG TGGTGGGGGC GACGGGGTCG
GGCAAGACCA CGCTCGCCAA GCTGGTGGCC AGGTTCATGG ACCCGACCGA GGGTGCGGTG
CTGCTCGACG GGGTCGATCT GCGGCGGATC GCGTCACGTG ACCTGCGCCG GCGCGTCGTG
CTCGTGCCGC AGGAAGGCTT CCTGTTCGAC GGCACCATCG CGGAGAACAT CGCCTACGGG
CTGCGGGACG AGGGTGGCGA CCGACCGTCC ACCGAGGGCT CGGCGCCGCA GGATGCCGAG
CGCGTCGCGC AGGTCGCGGC CGAGCTCGGG CTCGACGTAT GGCTCGCCGA GCTGCCGAGC
GGGCTGGCCA CCCCCGTCGG TCAGCGTGGT GAGCTGCTGT CGGCGGGGGA GCGGCAGCTC
GTGGCGCTGG CGCGCGCCCG GCTCGCGGAC GGTGACCTGC TGCTGCTCGA CGAGGCGACC
TCCGCGGTGG ACCCGGTGGC CGAGGTGCGG ATCGGTCGGG CGCTGCGCGA GCTCGCACGC
GGGCGCACGA CCCTGACGAT CGCGCACCGG CTGTCCACGG CCGAGGCCGC GCACCTCGTG
GTCGTCGTGC ACGCGGGCCG CGTCGTCGAG GTCGGCACGC ACGCGGAGCT GACCGCGCGC
GACGGGCAGT ACGCGCGGAT GCACGCGGCG TGGGTGGCGC AGACCCGCTG A
 
Protein sequence
MSASTSPAAA RQDSRMAPAS TLGVLATLRR AAQVSPELLD GLTVTLALAV VAATARVLVP 
LAVQQTVDTA ILAPGGVDVR RAALLVGVAA LGLAVGAGCS ALVNVRLFRS SEAGLLTLRS
RAFRHVHDLS VLTQNSERRG SLVSRVTSDV DTISMFVQWG GIMLLVSVLQ ILVATTLMAV
YSWQLTLLVW ACFLPLLVVL PRLQRGVNRR YAAVREQYGA MLGAVSEAVV GAETIRAYGV
AARTQRRIDA AVAGTRRAMV RAQNLVAVVF SSGVLVANLV LAVVVVAGTW LGVAGDLTVG
RVLAFLFLVQ LFTGPVQMAT EILNELQNAI SGWRRVLGVL ETPVDVPEPG DDVVPSPRGP
AALTFTGVGY AYPDGPPVLQ GVDLHVAAGT SVAVVGATGS GKTTLAKLVA RFMDPTEGAV
LLDGVDLRRI ASRDLRRRVV LVPQEGFLFD GTIAENIAYG LRDEGGDRPS TEGSAPQDAE
RVAQVAAELG LDVWLAELPS GLATPVGQRG ELLSAGERQL VALARARLAD GDLLLLDEAT
SAVDPVAEVR IGRALRELAR GRTTLTIAHR LSTAEAAHLV VVVHAGRVVE VGTHAELTAR
DGQYARMHAA WVAQTR