Gene Cfla_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3201 
Symbol 
ID9147117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3562721 
End bp3565093 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content71% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003638282 
Protein GI296131032 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0977104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000407645 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCGA CGCCCGCCCC GCAGCCCGCC CGCGCCGACA GCCACGACCT CATCCGCGTC 
CACGGCGCGC GCGAGAACAA CCTGCAGGAC GTCAGCGTCG ACCTGCCCAA GCGACGGCTG
ACGGTGTTCA CGGGCGTCTC CGGCTCGGGC AAGAGCTCGC TGGTCTTCGG CACCGTCGCC
GCGGAGTCCC AGCGGCTCAT CAACGAGACC TACAGCGCGT TCGTCCAGGG CTTCATGCCG
ACCCTGGCGC GACCCGACGT CGACGTCCTG GAGGGCCTGA CCACCGCGAT CATCGTCGAC
CAGGAGCGCC TGGGCGCCAA CTCGCGCTCG ACCGTCGGCA CCGTGACCGA CGCCAACGCG
ATGCTGCGCA TCCTGTTCAG CCGGCTCGGC GACCCGCACA TCGGCTCGCC GCAGGCCTAC
TCCTTCAACG TGCCGACCGT CCGGGCGTCG GGCGCGATCG AGGTGCAGCA GGGGCACAAG
CGCAAGGAGA GGGTGTCGTT CGAGCAGATC GGCGGCATGT GCGCGCGCTG CGAGGGCATG
GGCAAGGTCA ACGACGTCGA CCTCACCGCC CTCTACGACG ACGCGAAGTC GCTCGACGAG
GGCGCGCTGA CCGTCCCCGG CTACACGATG GACGGCTGGT ACGGGCGCAT CTTCCGCGCG
AGCGGGCTCT TCCCGCCCGA CCGCCCGATC CGCGAGTTCA CCGAGAAGCA GCTGCACGGC
CTGCTCTACC AGGAGCCGAC CAAGATCAAG GTCGAGGGCA TCAACGTCAC GTACGAGGGG
ATCATCCCCA AGATCCAGAA GTCGTTCCTC GCCAAGGACG TCGACGCGAT GCAGCCGCAC
ATCCGCGCGT TCGTCGAGCG GGCGTTCACC TTCCGCACCT GCCCGGAGTG CGACGGCACG
CGTCTCAACG AGGGCGCGCG GTCCTCCCGC ATCCGGGGGC GGAGCATCGC CGACCTGTGC
GCCATGCAGA TCACCGACCT CGCCGCGTGG GTCCGTGACC TCGACGAGCC GGGCGTCGCC
CCGCTGCTCG GCGCCCTGCG CGACACGCTC GACTCGTTCG TCGAGATCGG GCTCGGGTAC
CTCTCGCTCG ACCGGCCCTC CGGCACGCTG TCCGGCGGCG AGGCGCAGCG GACCAAGATG
ATCCGGCACC TCGGGTCGTC CCTCACCGAC GTCACCTACG TCTTCGACGA GCCGACCGTC
GGGCTGCACC CGCACGACAT CGCGCGCATG AACGACCTGC TGCTGCGCCT GCGGGACAAG
GGCAACACCG TGCTCGTCGT CGAGCACAAG CCCGAGGCCA TCGCGATCGC CGACCACGTC
GTCGACCTGG GCCCGGGTGC CGGCACCGAG GGCGGGCACG TGGTCTTCGA GGGCACGGTC
GAGGGCCTGC GGGCCAGCGG CACGCTGACC GGCCGTCATC TCGACGACCG CGCGACCCTC
AAGCCGTCGG TCCGCACGCC CACGGGCGCG CTGGAGGTGC GCGGCGCCAC CACGCACAAC
CTCCAGGGCG TCGACGTCGA CGTGCCGCTC GGCGTGCTCG TCGTCGTCAC CGGTGTCGCC
GGGTCCGGCA AGAGCTCGCT GATCCACGGC TCGGTCGCGG GGCGCGACGG CGTGGTCGCG
ATCGACCAGG GCGCGATCCG TGGCTCGCGC CGCTCGAACC CGGCGACGTA CACGGGCCTG
CTCGAGCCGA TCCGCAAGGC CTTCGCCAAG GCCAACGGGG TCAAGCCTGC GCTGTTCAGC
GCGAACTCCG AGGGCGCGTG CCCGGTCTGC AACGGCGCCG GCGTGGTCTA CACGGACCTC
GGCATGATGG CCGGCGTCTC CACGACGTGC GAGGTCTGCG AGGGCCGCCG CTTCCAGGCG
GAGGTGCTCG AGTACAGGCT GGGCGGGCGG GACATCAGCG AGGTCCTCGC GATGCCCGCG
ACCGTCGCCG AGGAGTTCTT CGCCGCCGGT GAGTCCCGGA TCCCCGCGAC GCACGCGATC
CTGCAGCGGC TCGTCGACGT GGGTCTCGGG TACGTCACGC TCGGCCAGCC GCTCACCACG
CTGTCCGGCG GTGAGCGGCA GCGTCTCAAG CTCGCGACGC ACCTGGGGGA CAAGGGCGGG
GTGTACGTGC TCGACGAGCC GACGACCGGT CTGCACCTCG CGGACGTCGA GAACCTGCTG
GGCCTGCTGG ACCGGCTCGT CGACGCGGGC AAGTCGGTCG TCGTCATCGA GCACCACCAG
GCGGTGATGG CGCACGCGGA CTGGATCATC GACCTCGGCC CCGGTGCCGG GCACGGCGGC
GGGCGCGTGG TGTTCGAGGG CACACCCGCG GACCTCGCCG CGGCGCGCTC GACGCTGACG
GGCGAGCACC TGGCGCAGTA CGTTGGCGCC TGA
 
Protein sequence
MTATPAPQPA RADSHDLIRV HGARENNLQD VSVDLPKRRL TVFTGVSGSG KSSLVFGTVA 
AESQRLINET YSAFVQGFMP TLARPDVDVL EGLTTAIIVD QERLGANSRS TVGTVTDANA
MLRILFSRLG DPHIGSPQAY SFNVPTVRAS GAIEVQQGHK RKERVSFEQI GGMCARCEGM
GKVNDVDLTA LYDDAKSLDE GALTVPGYTM DGWYGRIFRA SGLFPPDRPI REFTEKQLHG
LLYQEPTKIK VEGINVTYEG IIPKIQKSFL AKDVDAMQPH IRAFVERAFT FRTCPECDGT
RLNEGARSSR IRGRSIADLC AMQITDLAAW VRDLDEPGVA PLLGALRDTL DSFVEIGLGY
LSLDRPSGTL SGGEAQRTKM IRHLGSSLTD VTYVFDEPTV GLHPHDIARM NDLLLRLRDK
GNTVLVVEHK PEAIAIADHV VDLGPGAGTE GGHVVFEGTV EGLRASGTLT GRHLDDRATL
KPSVRTPTGA LEVRGATTHN LQGVDVDVPL GVLVVVTGVA GSGKSSLIHG SVAGRDGVVA
IDQGAIRGSR RSNPATYTGL LEPIRKAFAK ANGVKPALFS ANSEGACPVC NGAGVVYTDL
GMMAGVSTTC EVCEGRRFQA EVLEYRLGGR DISEVLAMPA TVAEEFFAAG ESRIPATHAI
LQRLVDVGLG YVTLGQPLTT LSGGERQRLK LATHLGDKGG VYVLDEPTTG LHLADVENLL
GLLDRLVDAG KSVVVIEHHQ AVMAHADWII DLGPGAGHGG GRVVFEGTPA DLAAARSTLT
GEHLAQYVGA