Gene Cfla_1369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1369 
Symbol 
ID9145253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1518863 
End bp1520989 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content75% 
IMG OID 
Product4-alpha-glucanotransferase 
Protein accessionYP_003636466 
Protein GI296129216 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.613124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00100057 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCCGAGA CGACCAGACA CGAGGACCTG TGGCGCCTCG CCGCCGCGTA CGACGTCGTG 
CCGCGGTACC AGGGGCACGA CGGGGACGAG CACGAGGCGT CGGACGAGAC GGTCGTGCGC
GTGCTGGCCG CGCTGGGCGT CGACGCGTCG TCACCCGAGC GGGTCGAGCT GGCGCTCGCG
CACGTCGAGA ACATGCCGTG GCGGCGCGTG CTGCCGCCCG TCGTGGTGCT CCGCGAGGGG
GCCGCGGTGC AGGTCCCGGT GCACGTGACG CACGGTGACC CGGTGGACGT GTGGCTCGAG
CTCGACGCCG AGGCGGGCGG CGGGCGGCGC GAGGTCACGC AGGTCGACGT CGTCGTGGAA
CCGCGCACCG TCGACGGCCG GCTCGTGGGC CGCGCGACGT TCGAGCTCCC TACCGACCTG
CCGCTCGGTT GGCACGAGAT CCGCGCGGAC GGCCCCAGTG CGCACGCGCA CAGCCCCGTC
GTCGTCACGC CGCACCGCCT CGAGCTCCCG GAGCGACTGC GCGAGGGCGG CGTCTGGGGG
CTCATGCCGC AGCTGTACTC GGTGCGCTCG CGCCGGTCGT GGGGCGTCGG TGACCTCGCC
GACCTCGCCG AGCTCGGCTG GCTCGCCGCC CACCGGTGGA AGGCCGACTT CCTCCTCGTC
AACCCGCTGC ACGCGGCCGA GCCGGTCGAG CCGCTCACGC CCAGCCCGTA CCTGCCCACC
ACGCGGCGCT TCGTCAACCC GCTGTACGTC CGGGTCGAGG ACGTGCCCGA GACGGCCTAC
CTGTCGGCCG CAGACCGCGC CCTCGTCGAG TGGGCCGCGG AACCCGTCCT GGGTCTCGCG
GCGGAACCCG GGCCGATCGA CCGCGACGTC GCCTGGGCCG CCAAGCGCGC CGCTCTCGAG
GTCGTGCACG CGCACCCGCT CGCGCCGGCG CGGCAGGCGG CCTTCGACGC GTACGTCGAG
GAGGAGGGCG AGGGGCTGCG CGAGTTCGCG CTGTGGTGCG CGCTCGTCGA GCGGCACGGC
CCGCCCGCGG GATGGTCCGA CGAGCTGCAC GACCCGCTGT CCGACGCGGT CGCCGCGGCC
GCCGTCGAGC TGGCCGACCG GGTCGCGTTC TGGTCGTGGC TGCAGTGGGT GGCCGACGAG
CAGCTCGAGC ACGCGCAGCG CGTGGCACGG GACAGCGGCA TGGCGCTGGG GATCATGCAC
GACCTCGCCG TGGGCGTGCA CCCCGAGGGC GCCGACGCGT GGGCGCTGCG GGACGTGCTC
GCGACGGGCG CCTCGGTCGG CGCCCCGCCG GACATGTACA ACCAGCAGGG CCAGGACTGG
TCGCAGCCGC CGTGGCACCC GGACGCGCTC GCGCGCGCGG CGTACCGGCC GTACCGCGAC
ATGCTGCGCA CCGTGCTGCG GCACGCCGGT GCGATCCGCA TCGACCACGT CATCGGGCTG
TTCCGGCTGT GGTGGGTGCC GGTCGGCAAC GGCGCCAAGG ACGGCGCGTA CGTGCGCTAC
GACCACGAGG CGCTCGTCGG CATCCTCGCG CTGGAGGCGC ACCGCGCCGG CGCCGTCGTC
ATCGGCGAGG ACCTCGGCAC CGTCGAGCCG TGGGTCCGCG ACTACCTGGA CGACCGAGGG
ATCCTCGGCA CGTCCGTGCT GTGGTTCGAG CAGGAGCACG ACGGCCGCCC CCGGCCTCCC
GAGTCCTACC GGCACCGCGC GCTCGCGACC GTCACGACGC ACGACCTGCC GCCCACCGCG
GGCTACCTGG CCGGTGAGCA CGTCGCGCTG CGCGACAGGC TGGGCCTGCT CACCGAGCCG
GTCGCCACGG TCCGCGCGCA CGCAGCCGCC GAGCGTGAGC GCATGCTCGA CGCGCTGCGC
GAGCGCGGCC TGCTGGGCCA CGACCCGTCG GAGCGCGAGA TCGTCGAGGC GCTGCACCGC
TGGGTCCGTG CCACGCCCGC CGTGCTGCTC GGCGTCTCGC TCGCCGACGC GGTGGGGGAG
CGGCGCGCGC AGAACCAGCC GGGCACGGAC CAGGAGTACC CGAACTGGAA GGTGCCGCTC
GCCGACGGCA CCGGGCGTCC GGTGCTGCTC GACGACCTGT TCACGCACGC GCGTGCGCAG
TCGCTCGCCG ACGCCATGGG GTCGTGA
 
Protein sequence
MPETTRHEDL WRLAAAYDVV PRYQGHDGDE HEASDETVVR VLAALGVDAS SPERVELALA 
HVENMPWRRV LPPVVVLREG AAVQVPVHVT HGDPVDVWLE LDAEAGGGRR EVTQVDVVVE
PRTVDGRLVG RATFELPTDL PLGWHEIRAD GPSAHAHSPV VVTPHRLELP ERLREGGVWG
LMPQLYSVRS RRSWGVGDLA DLAELGWLAA HRWKADFLLV NPLHAAEPVE PLTPSPYLPT
TRRFVNPLYV RVEDVPETAY LSAADRALVE WAAEPVLGLA AEPGPIDRDV AWAAKRAALE
VVHAHPLAPA RQAAFDAYVE EEGEGLREFA LWCALVERHG PPAGWSDELH DPLSDAVAAA
AVELADRVAF WSWLQWVADE QLEHAQRVAR DSGMALGIMH DLAVGVHPEG ADAWALRDVL
ATGASVGAPP DMYNQQGQDW SQPPWHPDAL ARAAYRPYRD MLRTVLRHAG AIRIDHVIGL
FRLWWVPVGN GAKDGAYVRY DHEALVGILA LEAHRAGAVV IGEDLGTVEP WVRDYLDDRG
ILGTSVLWFE QEHDGRPRPP ESYRHRALAT VTTHDLPPTA GYLAGEHVAL RDRLGLLTEP
VATVRAHAAA ERERMLDALR ERGLLGHDPS EREIVEALHR WVRATPAVLL GVSLADAVGE
RRAQNQPGTD QEYPNWKVPL ADGTGRPVLL DDLFTHARAQ SLADAMGS