Gene Mvan_4223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4223 
Symbol 
ID4645908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4529614 
End bp4530684 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content69% 
IMG OID639807690 
Producthypothetical protein 
Protein accessionYP_955006 
Protein GI120405177 
COG category[R] General function prediction only 
COG ID[COG3173] Predicted aminoglycoside phosphotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCCT CTGAGCTCTC GATCCCGCAG GACTGGGACG AGATCACTCC CGCCTGGATG 
ACCTCGGCTC TCGCGCAACA CTTTCCGGGA GCTGAGGTCG GCCACGTCCG GGTGGCACTT
CGTGACGACG GCACCAACCG CAGGGCCAGA CTCGCGCTGG ACTACTCCGT GGGGTCGGGT
CCGGCGACGG TCTTCGCCAA GGCCGTCGAC CCCGCCCACG CCGACCTGGT GGAGTTGACC
AGCGGGCTCT ACCACGAGCC GAGGCTCTTC TCCTGCGGTG CGAGGTTGCC GCTGGACCAT
CCGGAGGTCT ACCTCGCGGC CATCGACGAG GAACGCCGCG ACTTCCTGAT GATCATGGAG
GATGTGGCGT CGCGCGGGGC CGACCCGCGC GACTCCACGA GACCCATGAC GGTGGAGCAG
GCCGCCGCGG GAGTGCGTGG GTTGGCGCGG TTGCACGGCG AGTACTGGGG TGAGCGGCTC
ACCGGCGACC CTGCACTGAA TTGGGTGGAG CCGTTCGTGG CGTTCGAGGG ACTCGAGTAC
GCCCCGCTGC ACATCGCCCA CGAACGGCTC GGCGACACGG TGCCCGCCGA GGTCCTCGAC
CTCAGCGGCA CCGACCTGTT CGTCGACATC TGGGCCCGCT ACATCGGCTC GCTGACCCGG
TCGGTACCGA CGCTGCTGCA CGGCGATCCG CACATCGGCA ACACCTACGT GCTGCCGGAC
GGCACCGTGG GATTCCTCGA CTGGCAGATG GTGCGCCGGG GCAGCTTCTC GTTGGACCTG
GGCTACTTCC TGCAGGGTTC GCTGACCACC GAGGACCGCA GGCGGGCCGA ACACGATCTG
CTCGACGAGT ACCGCGCCGC GCTGCGTCTG CCCGCGCAGG AACTCCCGAC GCGGGAAGAC
ATCTGGCTGG GCTATCGCGC CTCGGTCGCA CACGGCCTGG CGATCTGGAT GGCCACCCTC
TCCGGTGGCG ACGCCTGGCA GCGTGCCGAC ATATGCCTGG CGTTCGCGCA GCGCTATGCC
GCCGCGTTCG TCGACCTGGA TACCCGGGAG GCGCTGGACG CGATCACCTA G
 
Protein sequence
MTASELSIPQ DWDEITPAWM TSALAQHFPG AEVGHVRVAL RDDGTNRRAR LALDYSVGSG 
PATVFAKAVD PAHADLVELT SGLYHEPRLF SCGARLPLDH PEVYLAAIDE ERRDFLMIME
DVASRGADPR DSTRPMTVEQ AAAGVRGLAR LHGEYWGERL TGDPALNWVE PFVAFEGLEY
APLHIAHERL GDTVPAEVLD LSGTDLFVDI WARYIGSLTR SVPTLLHGDP HIGNTYVLPD
GTVGFLDWQM VRRGSFSLDL GYFLQGSLTT EDRRRAEHDL LDEYRAALRL PAQELPTRED
IWLGYRASVA HGLAIWMATL SGGDAWQRAD ICLAFAQRYA AAFVDLDTRE ALDAIT