Gene Cfla_0068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0068 
Symbol 
ID9143933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp86232 
End bp87386 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content74% 
IMG OID 
Productextracellular repeat protein, HAF family 
Protein accessionYP_003635187 
Protein GI296127937 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00675708 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCTCACCA GAACTGCGCC GAGGCTGCTG CTCGTCGGCG TGCTCGCCTG CACGCTCGCG 
GCGGTGCCCG TCACCGCGGA TCCACCCGCC GCGGCCCGCG CGGGCTCGGG GATCACCGTC
CGTGACCTGG GGACCCTCCC CGGTGACAGC TGGAGCGTCG CGGTGGACGT CAACGAGCAC
GGCCAGGTCA TCGGGTTCAG CCAGGCGGGG AACGGTGACA CGCGCGGGTT CCTCTGGGAC
CGCGGGGTGA TGCGTGACCT GGGGACGTTC TCCGCGGTGG CGATCAACGA CAGCGGGCAG
GTCGTGGGGA CGGCCCTCAA CGGCTCCGGC GCCCAGCAGG CGGCCATGTG GGACGGTGGC
CGGCTCCGGT ACCTCACCAC CCCGGACGGC GCCCCCAGCC GGGCGGTCCT CCTCAACGAG
CACGGCCAGG TCGTCGTGCA GACACGGGAG TACGGCGGCG ACGAGCCGGA CCGGCTGCGC
AACTACGTCT GGGACGACGG GGCCGTGACC GAGATCCCGC CCCTGCCGGG CAGCCCCTAC
ATGCACCCGT TGGACATCAA CGACCAGGGC TGGGTGACCG GCTACAGCCC CGGCCCCGGG
TCCCTCCGCC ACGGGTTCCT GTGGCGGGAC GGCGTCGTCA CCGACCTCGG CTCGGCCGCG
TCCGGGGACG TCGCCTCCAC GATGGGCCTG GCCGTGAACG AGGCCGGTCA GGTGGCGGGC
CAGGCGAGCG CATCGGACAC CGAGCACGCC GCCGCAGTCT GGCAGGACGG CGAGTGGATG
CGGCTCGGCC ACCGCGAGGG CTGGAGCGGC GCGACCGACA TCAACGAGCA CGGCACCGTC
GTGGGGTGGG CGAGCGACGG CGGCCCGCAC GAGCACGCCG TGCTCTACCG CGACGGCGAG
TGGACCGACC TCGCCCCGGC CGGCTCGCGT GCGATCGAGC TGAACGACCG GGACCAGGTC
ATCGGCTCCG TCGACGGCTA CACCCTCACC GCCGTGCTGT GGCAGGACGG CGAGACGCAC
CTCCTGCCGC CCCTCTACCC CGGCAGCGCG ACCACGGCGT ACGACATCAG CGAGCGCGGC
CAGGTCGCCG GCTCCGCCCG CGTGCTGTCC GGCGTGGAGC ACGCCGTGCT CTGGACGACG
GCGCGGGGCT CCTGA
 
Protein sequence
MLTRTAPRLL LVGVLACTLA AVPVTADPPA AARAGSGITV RDLGTLPGDS WSVAVDVNEH 
GQVIGFSQAG NGDTRGFLWD RGVMRDLGTF SAVAINDSGQ VVGTALNGSG AQQAAMWDGG
RLRYLTTPDG APSRAVLLNE HGQVVVQTRE YGGDEPDRLR NYVWDDGAVT EIPPLPGSPY
MHPLDINDQG WVTGYSPGPG SLRHGFLWRD GVVTDLGSAA SGDVASTMGL AVNEAGQVAG
QASASDTEHA AAVWQDGEWM RLGHREGWSG ATDINEHGTV VGWASDGGPH EHAVLYRDGE
WTDLAPAGSR AIELNDRDQV IGSVDGYTLT AVLWQDGETH LLPPLYPGSA TTAYDISERG
QVAGSARVLS GVEHAVLWTT ARGS