Gene Mflv_1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_1003 
Symbol 
ID4972330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp1051873 
End bp1052865 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content72% 
IMG OID640455200 
Productcellulase 
Protein accessionYP_001132274 
Protein GI145221596 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCCG CTGCCCGCGC AGCCGCACGA TGGATCGCCC CCTTCCTGAC GGTTGCCGCC 
GTCGCCGGTT TCGCGGTCGC GGCGCAACCC GCGCATCGCG GCACCGCACC GGAGGTCCGG
CTGGTCAGCG ACGCGAACCC GCTGGTCGGG CGCCCGTTTT ACGTCAACCC GAACTCCAAG
GGCACGCGCG CGGCGCAGAG CAACCCGGAC CCGCTGCTGG CCGCAGTGGT CAACACGCCC
ACCGCGTACT GGATGGACCA CATCTCCACC CCGGCGGTCG ACGCGAAGTA CATCGCGACC
GCTCAGGCCG CCGGCACCAC CCCGGTGCTG GCCCTCTACG GGATCCCGAA CCGCGACTGC
GGCAGCTACG CCGCGGGCGG GTTCGGATCG GCCGGGTCGT ACCGGGCGTG GATCGACGGA
GTCGCCGGCG CCATCGGCGG CGGGCCCGCC GCGGTGATCC TCGAACCGGA CGCGCTCGCG
ATGATCGACT GCCTGTCACC TGGCCGGCAG CAGGAGCGCC TGGAGCTCAT CGGCTACGCG
GTCGACACGC TGACCCGCAA CCCGGCGACC GCGGTGTACG TCGACGCCGG GCATTCGCGC
TGGGTGCCCG CCGACGTGAT GGCGGGCCGG CTCAACCAGG TCGGGATTGC GAAGGCGCGC
GGCTTCAGCC TCAACACCGC GAACTTCTTC ACCACCGAGG AGTCGGTGGG CTACGGCGGC
GCGATCTCCG GGATGACCGG CGGCAAGCCG TTCGTCATCG ACACGTCCCG CAACGGCGCC
GGACCGGTCG AGGGCGACGA CCTCTACTGG TGCAATCCGA GCGGCCGCGC TCTCGGCGTG
CGACCCACCA CCGACACGGG CAACCCGATG GTCGACGCGT TCCTGTGGGT CAAGCGGCCC
GGAGAATCCG ACGGCGCGTG CCGTGGTGCG CCCAGTGCGG GCACCTTCGT CGCCCAGTAC
GCGATCGACC TGGCCCGCAA CGCCGGTTGG TGA
 
Protein sequence
MSSAARAAAR WIAPFLTVAA VAGFAVAAQP AHRGTAPEVR LVSDANPLVG RPFYVNPNSK 
GTRAAQSNPD PLLAAVVNTP TAYWMDHIST PAVDAKYIAT AQAAGTTPVL ALYGIPNRDC
GSYAAGGFGS AGSYRAWIDG VAGAIGGGPA AVILEPDALA MIDCLSPGRQ QERLELIGYA
VDTLTRNPAT AVYVDAGHSR WVPADVMAGR LNQVGIAKAR GFSLNTANFF TTEESVGYGG
AISGMTGGKP FVIDTSRNGA GPVEGDDLYW CNPSGRALGV RPTTDTGNPM VDAFLWVKRP
GESDGACRGA PSAGTFVAQY AIDLARNAGW