Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1232 |
Symbol | |
ID | 5135738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 1298958 |
End bp | 1301843 |
Gene Length | 2886 bp |
Protein Length | 961 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640532690 |
Product | aminotransferase, class III/decarboxylase, group II |
Protein accession | YP_001217176 |
Protein GI | 147674813 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0076] Glutamate decarboxylase and related PLP-dependent proteins [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases |
TIGRFAM ID | [TIGR00709] 2,4-diaminobutyrate 4-transaminases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000122861 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACAG CCTTTGAAGT CGATAACCAC ATCGCAACTC TGTTTTCAAC TCAAGTGCCG TTGCTAGACG GTCTTTACGA TCTCACTCCA GACCAAGTGC TGCTCGATCA AGCTGCGCAT GAATCTGAAG TGCGTTCTTA TCCACGCCGT ATTCCTATTG CCATTAAGCA AGCCTATGGC TGTTTGGTGG AAGATACGCG TGGGCAGATT TTCTTGGATT GCTTAGCCGG TGCAGGTACT TTGGTGCTCG GTTATAACCA TCCAGAAATC AATCAAGCAC TGAAAGCACA ACTCGACTCT GGTCTGCCAT ACCAAACTCT GGATATTGCA ACCGAAGCGA AAACCCATTT CATCAAGACC GTGAAAGGCT TTTTGCCTAA GGCCTTGGGT GAAGATTGTG TGATCCAGTT CTGTGGCCCA TCGGGTGCCG ATGCGGTTGA GGCTGCAATT AAGCTTGCCA AGCAAACCAC GGGGCGTAAC ACCATGTTTG CCTTCCGTGG TGCCTGCCAC GGTATGACTA ACGGCACGAT GGGGATGATG GGTAACCTCA ATACCAAAGC GCGCCGTACG GGCTTGATGT CTGATGTGCA CTTTATGCCT TTCCCGTACA GCTTACGCTG TCCGTTTGGT TTAGGGGGTG ATGAGGGTGC GAAAGCCAGT ATCCGCTACA TCGAACGTCT GCTGAATGAT GATGAAGCGG GGATCATGAA ACCTGCTGCG ATCATCGTTG AACCCGTACA AGGTGAGGGA GGTGTTATCC CTGCGCCAGC ATTCTGGTTG CGCGAACTGC GCCGCATTTG TGATGAGCAC GGCATCCTGT TGATCTTTGA CGAAATCCAA TGTGGTGTAG GCAAAACAGG CCACAACTTT GCATTTGAAG AGTCCGGCAT TATTCCTGAT GTACTTTGCT TATCAAAAGC GATCGGCGGC GGTTTACCCA TGTCGATTCT GGTTATCAAC AAAAAACACG ACACATGGCG TCCGGGTGAG CATACCGGAA CCTTCCGTGG CAACCAGCTT GCAATGGTAT CTGGCGCTAA AGCGCTTGAA ATCATTCAGC GTGATAACCT CGTTGAGCAT GCGCGTATTG CGGGTCAATA CCTACGCGCT GGCCTAGAAA AAATCCAAAG CCGAGTCAAC TGTGTTGCCG AAGTACGTGG AAAAGGCTTG ATGCTCGGTC TTGAGATTAA AGATCCAAGC GGCGAACTGA ACAAGTTTGG TGAGCCAAAA TCAGCACCAC AACTGACACT GGCAATTCAA CGTGCGGCAC TTGAGCGCGG CTTGATGGTA GAAAAAGGCG GCCGTGATGG TTCCGTGATT CGTTTCTTGC CACCAGTCAT CATCTCGTTT GAACAGCTCG ATTTCGCGTT ACGCGTGCTT GAAGAGGCGA TTCTTGCTGC AGGCGGCGGC AAAAAAGATC CTGAGCAAGT TAATCAAGAG TGGAAAAAAC ACTTTATTCA CACTGGCCAC ATGGGCAGCC AAGAGTTCTC ACAAGTGATG AACCATACGA CCGCTGCGAT GAAAGCCGTG TTTGAAGAGG TGAAAGCACC CTATTCAGGC CTTGATCCTA AAGTGCTGGA AGAAGCGATT TACGCGGTGG ATCTGGATAA CAAAAATGCG TCACTAAAAG AAGTTATCTC TGAAACCGCA GAGTTGATCG CGAAGAACTC CATCATGGTT CAACACCCTG ACTGTATTGC GCATTTGCAC ACGCCACCCT TAATGCCTTC CGTTGTTGCT GAAGCCATTA TTGCGTCATT GAACCAATCC ATGGACTCAT GGGATCAATC TTCTGCAGCA ACCTTTGTTG AGCAGAAAGT GGTGGATTGG ATGTGTGAAA AGTATGAACT GGGCGCGCAA GCGGACGGTA TTTTCACCAG CGGTGGCACG CAAAGCAACC AGATGGGCTT AATGCTGGCT CGTGACTGGA TCGCCGATAA GCTCAGTGGC CATTCAATCC AGAAATTGGG TCTACCTGAG TATGCGGACA AACTGCGTAT TGTGTGCTCG AAGAAATCTC ACTTCACGGT ACAAAAATCG GCATCTTGGA TGGGCCTCGG TGAGAAAGCC ATTCTTGCTG TTGACGCTCT ACCTAATGGC ACGATGGATG TGACTAAGCT TGAAGCTGCG GTTGAACAAG CCAAAGCTGA AGGGCTTATT CCCTTTGCCA TTGTGGGTAC CGCAGGCACT ACCGACCATG GTGCGATTGA TGATCTGGTG ACGATCGCCG ATGTGGCTGA GAAGCACGCA CTGTGGATGC ACGTGGACAG TGCCTATGGT GGCGCACTGA TCCTGAGCAG TCATAAAGAT CGCCTCAATG GCATCGAGCG CGCTCAATCC ATCAGTGTGG ATTTCCATAA GCTGTTTTTC CAAACCATTA GCTGTGGCGC ATTGCTGCTC AAAGACAAGC ATAACTTTAA GTATCTGCTG CACCATGCAG ATTACCTAAA CCGTGAGCAC GATACGCTGC CAAACCTAGT CGATAAATCG ATTTCGACTA CTAAACGTTT TGATGCCCTA AAAGTGTTTA TGACCATGCA AAACGTGGGA CCTAAACAAC TGGGCGCCAT GTACGATCAC CTGCTCGCTC AAACATTGCA GGTGGCGGAA CTGGTACGTC AGCATCAAAG CTTTGAGCTT TTAGCTGAGC CATCTTTGTC TACCGTGTTA TTCCGCGCGG TAAATGAACA AGCTGCTGAT CTGGATGAAT TGAACAAAGC CGTTCGTCTA CAGGCGCTGG TACGTGGTGT CGCTGTGCTT GGCGAAACCA TAGTGGATGG CAAAACGGCC TTAAAATTCA CAATCTTGAA CCCATGCTTG ACGATGTCGG ATTTCGACTC TCTACTGGCT AAAATTGAAG CTCTAGCTGC TGAGCTAGCG AACTAA
|
Protein sequence | MSTAFEVDNH IATLFSTQVP LLDGLYDLTP DQVLLDQAAH ESEVRSYPRR IPIAIKQAYG CLVEDTRGQI FLDCLAGAGT LVLGYNHPEI NQALKAQLDS GLPYQTLDIA TEAKTHFIKT VKGFLPKALG EDCVIQFCGP SGADAVEAAI KLAKQTTGRN TMFAFRGACH GMTNGTMGMM GNLNTKARRT GLMSDVHFMP FPYSLRCPFG LGGDEGAKAS IRYIERLLND DEAGIMKPAA IIVEPVQGEG GVIPAPAFWL RELRRICDEH GILLIFDEIQ CGVGKTGHNF AFEESGIIPD VLCLSKAIGG GLPMSILVIN KKHDTWRPGE HTGTFRGNQL AMVSGAKALE IIQRDNLVEH ARIAGQYLRA GLEKIQSRVN CVAEVRGKGL MLGLEIKDPS GELNKFGEPK SAPQLTLAIQ RAALERGLMV EKGGRDGSVI RFLPPVIISF EQLDFALRVL EEAILAAGGG KKDPEQVNQE WKKHFIHTGH MGSQEFSQVM NHTTAAMKAV FEEVKAPYSG LDPKVLEEAI YAVDLDNKNA SLKEVISETA ELIAKNSIMV QHPDCIAHLH TPPLMPSVVA EAIIASLNQS MDSWDQSSAA TFVEQKVVDW MCEKYELGAQ ADGIFTSGGT QSNQMGLMLA RDWIADKLSG HSIQKLGLPE YADKLRIVCS KKSHFTVQKS ASWMGLGEKA ILAVDALPNG TMDVTKLEAA VEQAKAEGLI PFAIVGTAGT TDHGAIDDLV TIADVAEKHA LWMHVDSAYG GALILSSHKD RLNGIERAQS ISVDFHKLFF QTISCGALLL KDKHNFKYLL HHADYLNREH DTLPNLVDKS ISTTKRFDAL KVFMTMQNVG PKQLGAMYDH LLAQTLQVAE LVRQHQSFEL LAEPSLSTVL FRAVNEQAAD LDELNKAVRL QALVRGVAVL GETIVDGKTA LKFTILNPCL TMSDFDSLLA KIEALAAELA N
|
| |