Gene Mflv_0239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_0239 
Symbol 
ID4971861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp225030 
End bp226163 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content73% 
IMG OID640454444 
Productglycoside hydrolase family protein 
Protein accessionYP_001131522 
Protein GI145220844 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4833] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGA TGTGGGCCAA CCGCGCCGCC AGCGCCGAAG CCGCCATCAC CACCCGGCAC 
CTGCGTCGGG TGTGGGGACT GCCCGGCACG CAGCTCGGGG TGGTCGCCTG GCCCGCGGCG
ACCGGCCACC GCCGGTTCGG CACGTGGCAC TACTGGTGGC AGGCACATCT GCTCGACAAC
CTCGTCGACG CCCAGCTGCG GGACCCTGCG CCCGAGCGCC TCACCCGGAT CAACCGACAG
ATCCGGGGTC AGCGGATCCG CAACATCGGC AAGTGGACCA ACAGCTACTA CGACGACATG
GCCTGGCTGG CGCTGGCACT CGAACGCGCG CAGCGGCTCA CCGGCGTCGG CCGGCCCAAG
GCGCTGGACA CGCTGGCCGA GCAGTTCCTC ACGGCGTGGG TGCCCGAGGA CGGCGGCGGA
ATCCCGTGGC GCAAGCAGGA CCAGTTCTTC AACGCGCCCG CCAACGGGCC CGCGGCGATC
TTCCTGGCCC GGCACCTGGC CGCGCAGGGC GAGAGCCTGC GCCGGGCCCA GCAGATGGCC
GACTGGATCG ACGAGACGCT GCTCGACCCG CAGACCCATC TGATCTTCGA CGGCATCAAG
GGCGGCTCGC TGGTCCGCGC GCAGTACACC TACTGCCAGG GTGTGGTGCT GGGCGTCGAG
ACCGAGCTCG CCGCGCGCAC GTCCGACTCC CGCCACGCCG AACGGGTGCA CCGGCTGGTG
GCCGCCGTCG CCGAACACAT GGCGCCGGGC GGCGTCATCA ACGGCGCCGG CGGCGGTGAC
GGCGGGCTGT TCAACGGCAT CCTCGCGCGC TACCTGGCGC TGGTCGCGAC CGCGCTGCCC
GGCACCTCGC CGGCCGACGA GCAGGCCCGC CGGACCGCGA CGGAGCTGGT GCTCGCCTCG
GGCCGGGCGG CCTGGGACAA CCGCCAGACC GGCAAGGACA CCGAGGATCT CCCCCTGTTC
GGCGCGTTCT GGGACCGGCC CGCGCAGGTG CCCACCGCGG GGACGGCGGA CGCCCGCAGC
GTCGACGGCG CGGTCAACTC CTCGGAGATC CCGGAGCGGG ACCTGTCGGT CCAGTTGGCG
GGATGGATGC TCATGGAGGC CGCCTGCGTT GTGACCGCAG GTGACAGTGA CTGA
 
Protein sequence
MDQMWANRAA SAEAAITTRH LRRVWGLPGT QLGVVAWPAA TGHRRFGTWH YWWQAHLLDN 
LVDAQLRDPA PERLTRINRQ IRGQRIRNIG KWTNSYYDDM AWLALALERA QRLTGVGRPK
ALDTLAEQFL TAWVPEDGGG IPWRKQDQFF NAPANGPAAI FLARHLAAQG ESLRRAQQMA
DWIDETLLDP QTHLIFDGIK GGSLVRAQYT YCQGVVLGVE TELAARTSDS RHAERVHRLV
AAVAEHMAPG GVINGAGGGD GGLFNGILAR YLALVATALP GTSPADEQAR RTATELVLAS
GRAAWDNRQT GKDTEDLPLF GAFWDRPAQV PTAGTADARS VDGAVNSSEI PERDLSVQLA
GWMLMEAACV VTAGDSD