Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0275 |
Symbol | |
ID | 5103895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 232982 |
End bp | 234235 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640506181 |
Product | 4-aminobutyrate aminotransferase |
Protein accession | YP_001190376 |
Protein GI | 146303060 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases |
TIGRFAM ID | [TIGR00700] 4-aminobutyrate aminotransferase, prokaryotic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0862859 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTAG CTTCAAGGAT AATAGAAGAG GATTCAAGGT ATCTAATGCA GTCCTTCAGC AGATGGTATC CCCTGGTGAT TGAACGTGCC AAGGGATCCC TAGTGTATGA CGTGGATGGA AAGGAGTACA TTGACATGAA TGCAGGCATA GGGGTAATGG CCCTGGGACA CGGCAACGAG AAGATCATCC GCGCCGTTCA AGAGCAGATG AACAAGTTTT TCCACTATAG CCTAACCGAC TTCTATTACG ACCTAGCTGT AAGGGTAGCC CGCAAGCTGG TATCACTCGT GGGATTTCAG GGTAAGGTGT TTTACGCCAA CAGTGGCACG GAGAGTGTGG AAGCCAGCCT GAAGATAGCC AGGGGTCACA CTGGGAGGCA GTACATCATA GGTTTCACTA ACTCATTTCA CGGAAGAACA TTCGGATCCA TGTCCTTCAC CTCAAGTAAA TCGGTGCAAA GATCTGCGTT CTCACCACTT TTACCCTCCA CTCTCCTGGT TCCATACCCT GACAGACATA ACCCGCTTTG CCGAGAGGAT TGCGCCAACG CAGTTCTGGA ATACATTGAG GATTGGGTCC TGAAGAAGAT CGTGGACCCC AACGACGTTG CGGGTTTCCT CCTGGAACCT ATTCAGGGGG AGGGAGGAAT CATCGTCCCC CCAAGGGAGT TCCTTCAAGG TCTCCAGAGG ATAGCGAGGA AGAACGGGAT TCTCCTCATC CTAGATGAGG TCCAGACTGG AATAGGCAGG ACAGGAAAGA TGTTTGCCTT TGAGCATTTT GGTGTGGAGC CGGATCTGAT ATGTCTGGCC AAGGCCCTTG GGGGAGGCTT ACCCTTGGGG GCAGTGGTGG GGAGGAGTGA GGTCATGGAC CTCCCAAGAG GTTCCCATGC CAACACGTTC GGGGGAAATG CCCTAGCCCT GGCCGCGGCC GAGGTCGTGC TAGAGGAGGT TCCAGGGCTT CTAGGAAGGG TTAACTCCCT GGGTAAAATG ATCGTGGACA TTCTAGGCTC CACGAAGTCC AGATACGTGG AGGAGATTAG GGGTATGGGT CTCATGATAG GAGTAGACCT AAGGAGAGAT GGGGAACCCT ATGAGGAGGG GCTCGAGAAG GTTCTGAGGA GATCCTTCGA AAGAGGAGTA CTCGCCATAG GAGCAGGTGA ATCAGTAGTG AGATTACTTC CTCCCCTCGT GATAGAGGAA GAACTTGCTC AGAGGGGTAG CTCTATAATA AGGGAGGAAA TAGATAGGTT ATAG
|
Protein sequence | MSLASRIIEE DSRYLMQSFS RWYPLVIERA KGSLVYDVDG KEYIDMNAGI GVMALGHGNE KIIRAVQEQM NKFFHYSLTD FYYDLAVRVA RKLVSLVGFQ GKVFYANSGT ESVEASLKIA RGHTGRQYII GFTNSFHGRT FGSMSFTSSK SVQRSAFSPL LPSTLLVPYP DRHNPLCRED CANAVLEYIE DWVLKKIVDP NDVAGFLLEP IQGEGGIIVP PREFLQGLQR IARKNGILLI LDEVQTGIGR TGKMFAFEHF GVEPDLICLA KALGGGLPLG AVVGRSEVMD LPRGSHANTF GGNALALAAA EVVLEEVPGL LGRVNSLGKM IVDILGSTKS RYVEEIRGMG LMIGVDLRRD GEPYEEGLEK VLRRSFERGV LAIGAGESVV RLLPPLVIEE ELAQRGSSII REEIDRL
|
| |