Gene Cfla_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2049 
Symbol 
ID9145945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2286576 
End bp2288567 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content71% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003637143 
Protein GI296129893 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.146206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCA CGACGAGGAC CGACCGTCCC GCCGGCCCGC CGCCCGGCGG GCCGCGTGGC 
GGGCCCATGG GCATGGGTCT GGGGATGCCC GGTCAGAAGT CGATGGACTT CCGTGGCTCG
TTGCGGCGCC TGCTCACGGT GCTGGCGCCC GAGCGTGTCC GGCTCGTCGC CGTACTCGTG
CTGGGGGCGT TGTCGGTGGC GGCGGCGGTT GCCGGCCCCA AGCTGCTGGG CAACGCGACC
GACGTGCTGT TCGACGGTGT GGTCTCGCGG CAGCTCGCGC AGCTCCTGCC GGCCGGCAGC
ACGCAGCAGG AGGCGGTCGA CGCGCTGCGC GCCGCGGGGC AGGGCACGGT GGCAGACATG
GTCGCGGGCA TGCCCGGCCT GACCGTCGGC GACGGCGTCG ACTTCGAGCG GCTCGGCTCG
ATCCTGCTGC TGGTGCTCGG CGTGTACGTC GCCGCGTTCG TGTTCGGCTG GCTGCAAGGG
CGGCTGACGG CGCGCGCGGT GCAGAACACG GTCCTGCGCA TGCGGACGCA GGTCGAGGAG
AAGCTCACGC GCGTCCCGCT GTCGTACTTC GACAAGCAGC CGCGCGGTGA GCTGTTGTCG
CGTGTCACCA ACGACATCGA CAACGTCGCG CAGACCGTGC AGCAGACGCT CTCGCAGCTC
ATCACCTCGG TGCTGACGGT CGTCGGCGTG CTCGCGATGA TGTTCTGGAT CTCGCCGCTT
CTCGCGGTCG TCGCCCTGGT GACGGTCCCG CTGTCGGTCG TGGTCGCGGC CGCGATCGCC
AAGCGCTCGC AACCGCAGTT CGTCGAGCAG TGGGCGTGGA CCGGCAAGCT CAACGCCCAC
ATCGAGGAGA TGTTCACGGG CCACGCGCTG GTCACCGTCT TCGGCCGGCA GCAGGAGGCC
GCCGCGACGT TCGCCGAGCG CAACGGCAAG CTCTACGAGT CCGCGTTCCG GGCGCAGTTC
ATCTCCGGAA TCATCCAGCC GGCGCTGGGG TTCATCGCCA ACCTCAACTA CCTCGTCGTC
GCCGTGGTCG GTGGCCTGCG GGTCGCGTCG GGCACGATGT CGCTCGGCGA CGTGCAGGCG
TTCATCCAGT ACTCGCGGCA GTTCACGCAG CCGATCACGC AGATCGCGTC GATGGCGAAC
CTGCTGCAGT CCGGTGTCGC GTCCGCCGAG CGCGTGTTCG AGCTGCTGGA CGCGCAGGAG
CAGACGCCCG ACCCCGCGCA GCCCGCGACG CTGCCGGAAC GCGTGCGCGG CCGCGTCGCG
TTCGAGGACG TGTCGTTCCG CTACGACGCG GACACGCCGC TCATCGAGAA CCTGTCGGTC
GTCGCGGAGC CCGGGCAGAC CGTCGCGATC GTCGGGCCCA CGGGCGCCGG CAAGACCACT
CTCGTCAACC TCGTCATGCG GTTCTACGAG GTCGACTCCG GGCGCATCAC GCTCGACGGT
GTCGACACGC GGGACGTCAC GCGCGACGCG CTGCGGTCGC AGATCGGCAT GGTCCTGCAG
GACACGTGGC TGTACGAAGG GACGATCGCG GAGAACATCG CGTACGGCGT GGACTCCGCG
ACGCACGAGC AGGTCGTCGA GGCCGCCGTC GCGACCCACG TCGACCGGTT CGTGCGCACC
CTGCCCGACG GGTACGACAC CGTGCTCGAC GACGAGGGCG GCGCGGTGTC CGCCGGCGAG
AAGCAGCTGC TCACCATCGC GCGCGCGTTC CTCGCCGACC CGGCGATCCT CATCCTCGAC
GAGGCGACGT CGTCGGTCGA CACGCGCACC GAGGTGCTCG TGCAGCACGC GATGAACGCC
TTGCGCGCCG GGCGCACGTC GTTCGTCATC GCGCACCGGC TGTCCACGAT CCGCGACGCC
GACGTCATCC TCGTCATGGA GCACGGGAGG ATCGTCGAGC AGGGCACGCA CGACGACCTC
GTCGCGGCCG ACGGTGCGTA CGCGCAGCTG TACCGCAGCC AGTTCGCCGA GGCGGCCGCC
CCGGTCGACT GA
 
Protein sequence
MSATTRTDRP AGPPPGGPRG GPMGMGLGMP GQKSMDFRGS LRRLLTVLAP ERVRLVAVLV 
LGALSVAAAV AGPKLLGNAT DVLFDGVVSR QLAQLLPAGS TQQEAVDALR AAGQGTVADM
VAGMPGLTVG DGVDFERLGS ILLLVLGVYV AAFVFGWLQG RLTARAVQNT VLRMRTQVEE
KLTRVPLSYF DKQPRGELLS RVTNDIDNVA QTVQQTLSQL ITSVLTVVGV LAMMFWISPL
LAVVALVTVP LSVVVAAAIA KRSQPQFVEQ WAWTGKLNAH IEEMFTGHAL VTVFGRQQEA
AATFAERNGK LYESAFRAQF ISGIIQPALG FIANLNYLVV AVVGGLRVAS GTMSLGDVQA
FIQYSRQFTQ PITQIASMAN LLQSGVASAE RVFELLDAQE QTPDPAQPAT LPERVRGRVA
FEDVSFRYDA DTPLIENLSV VAEPGQTVAI VGPTGAGKTT LVNLVMRFYE VDSGRITLDG
VDTRDVTRDA LRSQIGMVLQ DTWLYEGTIA ENIAYGVDSA THEQVVEAAV ATHVDRFVRT
LPDGYDTVLD DEGGAVSAGE KQLLTIARAF LADPAILILD EATSSVDTRT EVLVQHAMNA
LRAGRTSFVI AHRLSTIRDA DVILVMEHGR IVEQGTHDDL VAADGAYAQL YRSQFAEAAA
PVD