Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4610 |
Symbol | |
ID | 4646627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 4950420 |
End bp | 4953353 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639808080 |
Product | hypothetical protein |
Protein accession | YP_955391 |
Protein GI | 120405562 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.102283 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCACCC GATCCACCGC GGGCTTCAAC TTCCTGGAAC AGCAGGAGCT GCCCGCCCCG CAGGTGAGCG AGGCCCAGGC GCAGGACATC CTGGCCACAC ACTACGGTCT GGCCGCGCAT GTGAGCGCGC TGGGAAGCCA GCAGGACAAG AACTTCACGG TCCGCGACGA CGGCGGCGCG GTGGTCGGGG TGCTCAAGAT CGCCAACCCG GCGTTCACCG CCGAGGAATT GGCCGCCCAG GATGCGGCGG CCCGGCGGAT CGCCGAGGCC GAACCCGGTC TGCGGGTTGC GGTGCCGCTG GCCAACGCCG CAGGCGAGAC ACGCACCGCG GTCGACGGTG TGCTCGAGGG CACCGCCCTC GTGCGACTGC TGCAATTCCT GCCGGGCGGC ACCGTGTCGG AGTCCGGCTA CCTGACCCCG GACTCGGTGG CCGGCCTCGG CGATGTCGCG GGCCGCGTCA GCCGGGCCCT TGCGGATTTC ACCCATCCGG GCCTGGACCG GATCCTGCAG TGGGACTTGC GGTTCGGGAT GCATGTGGTC GACGAGCTGA GTGCGCACGT CGGTGAGCCC GCGCTGCGGC GCCGGCTGCA GACCGCCGCG CGCGACGCGT GGGCGCGGAT CGCACCACTC GACGATGCGC TGCCGCGACA GGCGGCTCAC ATCGACCTGA CCGACGCGAA CGTGGTGGTC TCCCCGGCGG ACGGCCGCCC CGACGGGGTC ATCGACTTCG GCGACCTCTC GCACACCTGG GCGGTCTCCG AGCTGGCGAT CACCGCCTCG TCGGTACTGG GGCACGTCGG TGCACAAGTC ACTTCGGTGT TGCCGGCGAT CCGCGCGTTC CACGCTGTGC GTCCGCTGTC GGTGGCAGAA GCCGACGCGC TCTGGCCGAT GCTGGTGTTG CGAACCGCGG TGCTGATCGT CAGCGGAGCG CAGCAGTCCG TCCTCGACCC CGACAATGAG TACCTCACCG AACAATCCGA CGCCGAGCAG CAGATGTTCG ACCTCGCCAC CTCGGTCCCG ATCGATGTGA TGACCGCGGT GATCAAGGCC GGCCTGGGGA TGGCGCAGCC GTCGCCGCCG GTTCGGGTGC AGGCGCAACT GATCGGCGCG GACAGGACCT CTACGGTCAC CCTCGATCTG TCCACCACAT CCGAGGTCTA TGACGACGCG TTCGACGCCG CCGGGGTGAT GCGCTCCGAT ATCGAGGACG AATCCGCAAG GGCCGCAATG CATCAGGGCG CCACGGTGGT GGTCACCCGC TTCGGAGAGG CCAGGCTGGA CCGGGCGCCG AGGTTGAGCC AGGACAGCCC TGAGGTGGTG GCCACCGGGA TCAGCATGTG GACAGCCGCC GACACCGACA TCGCAGCGCC GTGGGACGGC GAGGTGGTCA CCGATGCCAC GGGATCGATC ACGCTGCGCG GCAATGACTT CGAGGTGACC GTGGTCGGCG CGGCGCCGGC CGGCGGCGCC GCGGTGTGCG CCGGCGAGGT CCTGGCCAGT GCCCGGGCCG GTGAGCGGAT CGAGGTGAGC GTTCGCCCTG TCGGTGTGCC GGTCGCCCCA CCGTTCATCC GCGCCGATCT GGCACCCGGG TGGCTCGCAC AGGTCCGCGA CCCCAGGCCA CTGCTCGGGC TCGCTCCGCT CGAGCAGGAC GGCGCCGCCG ACCTGCTTTC CCGGCGGGAC GCGAGTTTCG CTCCGGTCCA GGAGTTCTAC TACCGCACGC CACCGCAGAT CGAACGCGGC CGACGGCATT ACCTGATGTC GACCGCGGGC CGCAGTTACC TCGACATGGT CAACAACGTC ACCGTGCTCG GGCACGCACA CCCGCGAATC GCCGACACCG CCGCCCGCCA GTTGCGCAGG CTCAACACCA ATTCCCGGTT CAACTACGAG GCCGTCGTCG AATTCAGCGA GCGGCTGGCG GCGCTGCTGC CCGACCCACT GGACACGGTC TTCCTGGTCA ACTCCGGCTC GGAGGCCAGC GACCTGGCGA TCAGGTTGGC CACCGCGGCC ACCGGCCGAC GCGACGTGGT CGCGGTCCGT GAGGCCTATC ACGGGTGGAC GTACGGCACC GACGCGGTGT CGACGTCGAC CGCCGACAAC CCCAATGCGC TTGCCACCCG CCCGGATTGG GTGCACACCG TCGAGTCACC CAACAGCTTC CGCGGCAAGT ACCGCGGCTC GGAGGCGTTC CGCTACGCCG AGGACGCGGT CGCCCAGATC GAGGCGCTGG TCATGTCGGG GCGACCGCCG GCGGCGTTCA TCTGCGAAAG CGTGTACGGC AACGCCGGCG GCATGGCGCT GCCGGACGGC TACCTGAAGC AGGTTTACGC GGCGGTGCGG GCCGGCGGCG GGCTGGCGAT CTCCGATGAA GTCCAGGTCG GCTACGGCCG GCTCGGTGAG TGGTTCTGGG GATTCCAGCA GCAGGATGCG GTGCCCGACA TCGTGTCGGT GGCGAAGTCC GTGGGCAACG GTTACCCGGT GGGAGCGGTG ATCACCACCC GCGCCGTGGC CGAGGCGTTC TCCAGCCAGG GTTACTTCTT CTCCTCCACC GGCGGAAGCC CGCTGTCCTG TGCGATCGGG ATGACGGTGC TCGACGTGCT GCGCGACGAG GGACTGCAGG ACAACGCCCG CCGCGTCGGC ACTCACCTCA AGACCAGGCT GGAAGGGCTG AAGGAACGTC ACCCGCTCGT CGGCACCGTG CACGGGTTCG GGCTGTACCT GGGGGTCGAG ATGATCCGCG ACCCGCAGAC CTTGACCCCG GCAACCGCGG AGACCTCGGC GATCTGCGAC CGGATGCTCG ACCTCGGCGT GATCATCCAG CCCACCGGCG ACCACCAGAA CATCCTCAAG ACCAAACCGC CGCTGTGTAT CGACGTCGAA GCAGCCGACT TCTACGTCGA CACCCTTGAC CGGGTCTTGA CCGAAGGTTG GTAA
|
Protein sequence | MSTRSTAGFN FLEQQELPAP QVSEAQAQDI LATHYGLAAH VSALGSQQDK NFTVRDDGGA VVGVLKIANP AFTAEELAAQ DAAARRIAEA EPGLRVAVPL ANAAGETRTA VDGVLEGTAL VRLLQFLPGG TVSESGYLTP DSVAGLGDVA GRVSRALADF THPGLDRILQ WDLRFGMHVV DELSAHVGEP ALRRRLQTAA RDAWARIAPL DDALPRQAAH IDLTDANVVV SPADGRPDGV IDFGDLSHTW AVSELAITAS SVLGHVGAQV TSVLPAIRAF HAVRPLSVAE ADALWPMLVL RTAVLIVSGA QQSVLDPDNE YLTEQSDAEQ QMFDLATSVP IDVMTAVIKA GLGMAQPSPP VRVQAQLIGA DRTSTVTLDL STTSEVYDDA FDAAGVMRSD IEDESARAAM HQGATVVVTR FGEARLDRAP RLSQDSPEVV ATGISMWTAA DTDIAAPWDG EVVTDATGSI TLRGNDFEVT VVGAAPAGGA AVCAGEVLAS ARAGERIEVS VRPVGVPVAP PFIRADLAPG WLAQVRDPRP LLGLAPLEQD GAADLLSRRD ASFAPVQEFY YRTPPQIERG RRHYLMSTAG RSYLDMVNNV TVLGHAHPRI ADTAARQLRR LNTNSRFNYE AVVEFSERLA ALLPDPLDTV FLVNSGSEAS DLAIRLATAA TGRRDVVAVR EAYHGWTYGT DAVSTSTADN PNALATRPDW VHTVESPNSF RGKYRGSEAF RYAEDAVAQI EALVMSGRPP AAFICESVYG NAGGMALPDG YLKQVYAAVR AGGGLAISDE VQVGYGRLGE WFWGFQQQDA VPDIVSVAKS VGNGYPVGAV ITTRAVAEAF SSQGYFFSST GGSPLSCAIG MTVLDVLRDE GLQDNARRVG THLKTRLEGL KERHPLVGTV HGFGLYLGVE MIRDPQTLTP ATAETSAICD RMLDLGVIIQ PTGDHQNILK TKPPLCIDVE AADFYVDTLD RVLTEGW
|
| |