Gene Cfla_1126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1126 
Symbol 
ID9145005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1257458 
End bp1258759 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content69% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003636229 
Protein GI296128979 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.162135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000581091 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAGGA CTGCGGCACT GCGCGCGGTC GCGCTCGGCG CAGCGGTGGC GATGATCGCG 
ACCGCCTGCA GCTCCGGGAG CGGCAGCGAC GACCCCACCG AGGGTGCTGG CGAGAACGTC
ACCCTCACGT GGTGGCACAA CTCCAACACC GGTGCGGGGA AGGACTACTA CGACCAGCTG
GCCGACGAGT TCGAGTCGGA CAACCCTGGC GTGACGATCG AGGTCAGCGC CCTCCAGCAC
GAGGACATGC TCACCAAGCT CGACGCCGCG TTCCAGACCG GCGACCAGCC GGACATCTTC
ATGGAGCGTG GTGGCGGCGA GCTCAAGGCG CACGTCGCCG CGGGCCTCGT CAAGGACATC
ACCGACGACG CCGCGGACAC GATCTCGGCG CTCGGCGGCT CGGTCAGCGG CTGGACGGTC
GAGGACCGTG TCTACTCGCT GCCCTTCTCC ATGGGTGTTG TCGGCTTCTG GTACAACAAG
TCGATGTTCG CCCAGGCGGG CATCACCGAG GCGCCCAAGA CGATGGACGA CCTGTACGCC
GCGGTCGAGG CGCTCAAGGG CGCCGGCATC GAGCCGATCT CGGTCGGCGC CGGTTCCGCC
TGGCCCGCCG CGCACTACTG GTACTACTTC GCCCTGCGTC AGTGCTCGCA GGACACGATC
GCCACGGCGT CCCAGGAGCT CGAGTTCACG GACCCCTGCT GGGTCAAGGC GGGCGAGAGC
CTGGCCGACC TGGTGGCGCA GGAGCCGTTC AACACCGGCT TCCTCGGCAC CGAGGCCCAG
GGCACGCCCG AGTCGGCCTC CGGCCTCCTC GCGAACCGCA AGGTCGCGAT GGAGCTCGCC
GGCCACTGGG AGCCCGGCGT CATGCAGGGC CTGACGGAGG ACGAGCAGGG CCTCGGCGAG
GACACCGGCT GGTTCCCGTT CCCGGAGGTC GCGGGTGGCG AGGGTGACCC GGCCGCCCAG
CTCGGTGGCG GTGACGCGTG GGCGTGCTCG AACGACGCGC CGGACATCTG CGTCGACTTC
ATCGAGTTCA TGCTGTCGAA CGACGTCCAG AAGGGCTTCG CGGAGCTCGA CATGGGCCTG
CCGACGCTCC CGTCCGCCAC GGCGTTCGTC GCGGCCCCGG AGCTGGCGCA GCTGCTCTCG
TACCGGAACG ACGCTCCGTA CGTCCAGCTG TACTTCGACA CGCAGTTCGG CGAGAACATC
GGTGGCGCCA TGAACGAGGC CATCGTGTCG GTGTTCGCGG GGAGCGGGAC GCCTCAGGGC
ATCGTCGACG CGACCCAGGC CGCGGCTGAC CTCGAGAAGT GA
 
Protein sequence
MKRTAALRAV ALGAAVAMIA TACSSGSGSD DPTEGAGENV TLTWWHNSNT GAGKDYYDQL 
ADEFESDNPG VTIEVSALQH EDMLTKLDAA FQTGDQPDIF MERGGGELKA HVAAGLVKDI
TDDAADTISA LGGSVSGWTV EDRVYSLPFS MGVVGFWYNK SMFAQAGITE APKTMDDLYA
AVEALKGAGI EPISVGAGSA WPAAHYWYYF ALRQCSQDTI ATASQELEFT DPCWVKAGES
LADLVAQEPF NTGFLGTEAQ GTPESASGLL ANRKVAMELA GHWEPGVMQG LTEDEQGLGE
DTGWFPFPEV AGGEGDPAAQ LGGGDAWACS NDAPDICVDF IEFMLSNDVQ KGFAELDMGL
PTLPSATAFV AAPELAQLLS YRNDAPYVQL YFDTQFGENI GGAMNEAIVS VFAGSGTPQG
IVDATQAAAD LEK