Gene Cfla_3404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3404 
Symbol 
ID9147320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3788287 
End bp3789381 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content72% 
IMG OID 
Productputative sugar uptake ABC transporter periplasmic solute-binding protein precursor 
Protein accessionYP_003638480 
Protein GI296131230 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.538992 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACCA GATCCCTCGC CCTCGTGGCA CTGACCGCCG CCGGCGCCCT CGCCCTCGGC 
GGGTGCGGCG GCGGCCGCGA AGGAACAGCG GATGCCGCAG CCGCCGGTGC GGGCGACGGC
TTCGCCGACG ACGCCGTCAT CGGGGTGTCC CTGCCGTGGC TCGGTACCCA GAACTGGGCC
GAGGCGCAGG AGATGTTCAC CACCCGGCTG ACGGAGGCCG GCTTCGAGCC GCTCGTCCAG
GCGGCGGACA ACAAGGTGAC CCAGCAGCAG CAGCAGATCG AGGCGATGAT CGAGCGCGGC
GCCGAGGTGA TCGTCGTCGG CCCCGTCGAC GGCACCCAGC TCGGCAGCGT GCTCGAGCGC
GCCGCCGCGG AGGGCATCGC GGTGATCGGG TACGACCGGC TCATCGAGAA CACGCCGGCC
GTCGACGCGG TCGTGCAGTT CGGCAGCCTG CGCACCGGCG AGCTGCAGGG GCAGTCGCTC
CTCGACGGGC TCGCGGCACG CAAGGGCGAG CCGCCGTACC ACGTCGAGCT GTTCGGAGGC
GGTCCCGCGG ACCCGAACGC CCCGGCGTTC TTCGAGGGCG CCATGTCCGT CCTGCAGCCG
AAGATCGACG ACGGCACCCT GGTCGTCGGG TCGGGCCAGA CCGAGTTCAC GCAGGCCGCG
ACACCCGACT GGGACAACGC CAAGGCCCAG GCACGCATGG ACTCCCTGCT GTCGGGCTTC
TACAGCGCCG AGGAGATCGA CGGTGTGCTG TCGCCGAACG ACGGCATCGC GCGCGCTGTC
ATGACGTCGG CGCAGCAGGC GGGCCAGGAG ACACCCGTGG TAACCGGCCT CGACGCCGAG
AACGAGTCGG TCGTGTCGGT GTGGCAGGGG CAGCAGTGGT CGACGGTCGC CAAGCCGACC
GTCGAGCTGG TCGGCCGCAC GGTCGAGCTG ATCCAGTCCC TCCAGCAGGG CGAGGCGCTG
CCGGAGCCGG ACGAGGAGGT CGACAACGGC CAGACGGACG TCGCCGTGTA CCTGCTCGAC
CCCCTCGTGG TGACGCAGGA GAACGCCCAG GAGGTCTTCG CCGACGACCC CAACCGCCTG
CAGCTGCTGC AGTAG
 
Protein sequence
MRTRSLALVA LTAAGALALG GCGGGREGTA DAAAAGAGDG FADDAVIGVS LPWLGTQNWA 
EAQEMFTTRL TEAGFEPLVQ AADNKVTQQQ QQIEAMIERG AEVIVVGPVD GTQLGSVLER
AAAEGIAVIG YDRLIENTPA VDAVVQFGSL RTGELQGQSL LDGLAARKGE PPYHVELFGG
GPADPNAPAF FEGAMSVLQP KIDDGTLVVG SGQTEFTQAA TPDWDNAKAQ ARMDSLLSGF
YSAEEIDGVL SPNDGIARAV MTSAQQAGQE TPVVTGLDAE NESVVSVWQG QQWSTVAKPT
VELVGRTVEL IQSLQQGEAL PEPDEEVDNG QTDVAVYLLD PLVVTQENAQ EVFADDPNRL
QLLQ