Gene Mvan_4610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4610 
Symbol 
ID4646627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4950420 
End bp4953353 
Gene Length2934 bp 
Protein Length977 aa 
Translation table11 
GC content69% 
IMG OID639808080 
Producthypothetical protein 
Protein accessionYP_955391 
Protein GI120405562 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases
[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.102283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCC GATCCACCGC GGGCTTCAAC TTCCTGGAAC AGCAGGAGCT GCCCGCCCCG 
CAGGTGAGCG AGGCCCAGGC GCAGGACATC CTGGCCACAC ACTACGGTCT GGCCGCGCAT
GTGAGCGCGC TGGGAAGCCA GCAGGACAAG AACTTCACGG TCCGCGACGA CGGCGGCGCG
GTGGTCGGGG TGCTCAAGAT CGCCAACCCG GCGTTCACCG CCGAGGAATT GGCCGCCCAG
GATGCGGCGG CCCGGCGGAT CGCCGAGGCC GAACCCGGTC TGCGGGTTGC GGTGCCGCTG
GCCAACGCCG CAGGCGAGAC ACGCACCGCG GTCGACGGTG TGCTCGAGGG CACCGCCCTC
GTGCGACTGC TGCAATTCCT GCCGGGCGGC ACCGTGTCGG AGTCCGGCTA CCTGACCCCG
GACTCGGTGG CCGGCCTCGG CGATGTCGCG GGCCGCGTCA GCCGGGCCCT TGCGGATTTC
ACCCATCCGG GCCTGGACCG GATCCTGCAG TGGGACTTGC GGTTCGGGAT GCATGTGGTC
GACGAGCTGA GTGCGCACGT CGGTGAGCCC GCGCTGCGGC GCCGGCTGCA GACCGCCGCG
CGCGACGCGT GGGCGCGGAT CGCACCACTC GACGATGCGC TGCCGCGACA GGCGGCTCAC
ATCGACCTGA CCGACGCGAA CGTGGTGGTC TCCCCGGCGG ACGGCCGCCC CGACGGGGTC
ATCGACTTCG GCGACCTCTC GCACACCTGG GCGGTCTCCG AGCTGGCGAT CACCGCCTCG
TCGGTACTGG GGCACGTCGG TGCACAAGTC ACTTCGGTGT TGCCGGCGAT CCGCGCGTTC
CACGCTGTGC GTCCGCTGTC GGTGGCAGAA GCCGACGCGC TCTGGCCGAT GCTGGTGTTG
CGAACCGCGG TGCTGATCGT CAGCGGAGCG CAGCAGTCCG TCCTCGACCC CGACAATGAG
TACCTCACCG AACAATCCGA CGCCGAGCAG CAGATGTTCG ACCTCGCCAC CTCGGTCCCG
ATCGATGTGA TGACCGCGGT GATCAAGGCC GGCCTGGGGA TGGCGCAGCC GTCGCCGCCG
GTTCGGGTGC AGGCGCAACT GATCGGCGCG GACAGGACCT CTACGGTCAC CCTCGATCTG
TCCACCACAT CCGAGGTCTA TGACGACGCG TTCGACGCCG CCGGGGTGAT GCGCTCCGAT
ATCGAGGACG AATCCGCAAG GGCCGCAATG CATCAGGGCG CCACGGTGGT GGTCACCCGC
TTCGGAGAGG CCAGGCTGGA CCGGGCGCCG AGGTTGAGCC AGGACAGCCC TGAGGTGGTG
GCCACCGGGA TCAGCATGTG GACAGCCGCC GACACCGACA TCGCAGCGCC GTGGGACGGC
GAGGTGGTCA CCGATGCCAC GGGATCGATC ACGCTGCGCG GCAATGACTT CGAGGTGACC
GTGGTCGGCG CGGCGCCGGC CGGCGGCGCC GCGGTGTGCG CCGGCGAGGT CCTGGCCAGT
GCCCGGGCCG GTGAGCGGAT CGAGGTGAGC GTTCGCCCTG TCGGTGTGCC GGTCGCCCCA
CCGTTCATCC GCGCCGATCT GGCACCCGGG TGGCTCGCAC AGGTCCGCGA CCCCAGGCCA
CTGCTCGGGC TCGCTCCGCT CGAGCAGGAC GGCGCCGCCG ACCTGCTTTC CCGGCGGGAC
GCGAGTTTCG CTCCGGTCCA GGAGTTCTAC TACCGCACGC CACCGCAGAT CGAACGCGGC
CGACGGCATT ACCTGATGTC GACCGCGGGC CGCAGTTACC TCGACATGGT CAACAACGTC
ACCGTGCTCG GGCACGCACA CCCGCGAATC GCCGACACCG CCGCCCGCCA GTTGCGCAGG
CTCAACACCA ATTCCCGGTT CAACTACGAG GCCGTCGTCG AATTCAGCGA GCGGCTGGCG
GCGCTGCTGC CCGACCCACT GGACACGGTC TTCCTGGTCA ACTCCGGCTC GGAGGCCAGC
GACCTGGCGA TCAGGTTGGC CACCGCGGCC ACCGGCCGAC GCGACGTGGT CGCGGTCCGT
GAGGCCTATC ACGGGTGGAC GTACGGCACC GACGCGGTGT CGACGTCGAC CGCCGACAAC
CCCAATGCGC TTGCCACCCG CCCGGATTGG GTGCACACCG TCGAGTCACC CAACAGCTTC
CGCGGCAAGT ACCGCGGCTC GGAGGCGTTC CGCTACGCCG AGGACGCGGT CGCCCAGATC
GAGGCGCTGG TCATGTCGGG GCGACCGCCG GCGGCGTTCA TCTGCGAAAG CGTGTACGGC
AACGCCGGCG GCATGGCGCT GCCGGACGGC TACCTGAAGC AGGTTTACGC GGCGGTGCGG
GCCGGCGGCG GGCTGGCGAT CTCCGATGAA GTCCAGGTCG GCTACGGCCG GCTCGGTGAG
TGGTTCTGGG GATTCCAGCA GCAGGATGCG GTGCCCGACA TCGTGTCGGT GGCGAAGTCC
GTGGGCAACG GTTACCCGGT GGGAGCGGTG ATCACCACCC GCGCCGTGGC CGAGGCGTTC
TCCAGCCAGG GTTACTTCTT CTCCTCCACC GGCGGAAGCC CGCTGTCCTG TGCGATCGGG
ATGACGGTGC TCGACGTGCT GCGCGACGAG GGACTGCAGG ACAACGCCCG CCGCGTCGGC
ACTCACCTCA AGACCAGGCT GGAAGGGCTG AAGGAACGTC ACCCGCTCGT CGGCACCGTG
CACGGGTTCG GGCTGTACCT GGGGGTCGAG ATGATCCGCG ACCCGCAGAC CTTGACCCCG
GCAACCGCGG AGACCTCGGC GATCTGCGAC CGGATGCTCG ACCTCGGCGT GATCATCCAG
CCCACCGGCG ACCACCAGAA CATCCTCAAG ACCAAACCGC CGCTGTGTAT CGACGTCGAA
GCAGCCGACT TCTACGTCGA CACCCTTGAC CGGGTCTTGA CCGAAGGTTG GTAA
 
Protein sequence
MSTRSTAGFN FLEQQELPAP QVSEAQAQDI LATHYGLAAH VSALGSQQDK NFTVRDDGGA 
VVGVLKIANP AFTAEELAAQ DAAARRIAEA EPGLRVAVPL ANAAGETRTA VDGVLEGTAL
VRLLQFLPGG TVSESGYLTP DSVAGLGDVA GRVSRALADF THPGLDRILQ WDLRFGMHVV
DELSAHVGEP ALRRRLQTAA RDAWARIAPL DDALPRQAAH IDLTDANVVV SPADGRPDGV
IDFGDLSHTW AVSELAITAS SVLGHVGAQV TSVLPAIRAF HAVRPLSVAE ADALWPMLVL
RTAVLIVSGA QQSVLDPDNE YLTEQSDAEQ QMFDLATSVP IDVMTAVIKA GLGMAQPSPP
VRVQAQLIGA DRTSTVTLDL STTSEVYDDA FDAAGVMRSD IEDESARAAM HQGATVVVTR
FGEARLDRAP RLSQDSPEVV ATGISMWTAA DTDIAAPWDG EVVTDATGSI TLRGNDFEVT
VVGAAPAGGA AVCAGEVLAS ARAGERIEVS VRPVGVPVAP PFIRADLAPG WLAQVRDPRP
LLGLAPLEQD GAADLLSRRD ASFAPVQEFY YRTPPQIERG RRHYLMSTAG RSYLDMVNNV
TVLGHAHPRI ADTAARQLRR LNTNSRFNYE AVVEFSERLA ALLPDPLDTV FLVNSGSEAS
DLAIRLATAA TGRRDVVAVR EAYHGWTYGT DAVSTSTADN PNALATRPDW VHTVESPNSF
RGKYRGSEAF RYAEDAVAQI EALVMSGRPP AAFICESVYG NAGGMALPDG YLKQVYAAVR
AGGGLAISDE VQVGYGRLGE WFWGFQQQDA VPDIVSVAKS VGNGYPVGAV ITTRAVAEAF
SSQGYFFSST GGSPLSCAIG MTVLDVLRDE GLQDNARRVG THLKTRLEGL KERHPLVGTV
HGFGLYLGVE MIRDPQTLTP ATAETSAICD RMLDLGVIIQ PTGDHQNILK TKPPLCIDVE
AADFYVDTLD RVLTEGW