Gene Msed_0275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0275 
Symbol 
ID5103895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp232982 
End bp234235 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content52% 
IMG OID640506181 
Product4-aminobutyrate aminotransferase 
Protein accessionYP_001190376 
Protein GI146303060 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID[TIGR00700] 4-aminobutyrate aminotransferase, prokaryotic type 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0862859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTAG CTTCAAGGAT AATAGAAGAG GATTCAAGGT ATCTAATGCA GTCCTTCAGC 
AGATGGTATC CCCTGGTGAT TGAACGTGCC AAGGGATCCC TAGTGTATGA CGTGGATGGA
AAGGAGTACA TTGACATGAA TGCAGGCATA GGGGTAATGG CCCTGGGACA CGGCAACGAG
AAGATCATCC GCGCCGTTCA AGAGCAGATG AACAAGTTTT TCCACTATAG CCTAACCGAC
TTCTATTACG ACCTAGCTGT AAGGGTAGCC CGCAAGCTGG TATCACTCGT GGGATTTCAG
GGTAAGGTGT TTTACGCCAA CAGTGGCACG GAGAGTGTGG AAGCCAGCCT GAAGATAGCC
AGGGGTCACA CTGGGAGGCA GTACATCATA GGTTTCACTA ACTCATTTCA CGGAAGAACA
TTCGGATCCA TGTCCTTCAC CTCAAGTAAA TCGGTGCAAA GATCTGCGTT CTCACCACTT
TTACCCTCCA CTCTCCTGGT TCCATACCCT GACAGACATA ACCCGCTTTG CCGAGAGGAT
TGCGCCAACG CAGTTCTGGA ATACATTGAG GATTGGGTCC TGAAGAAGAT CGTGGACCCC
AACGACGTTG CGGGTTTCCT CCTGGAACCT ATTCAGGGGG AGGGAGGAAT CATCGTCCCC
CCAAGGGAGT TCCTTCAAGG TCTCCAGAGG ATAGCGAGGA AGAACGGGAT TCTCCTCATC
CTAGATGAGG TCCAGACTGG AATAGGCAGG ACAGGAAAGA TGTTTGCCTT TGAGCATTTT
GGTGTGGAGC CGGATCTGAT ATGTCTGGCC AAGGCCCTTG GGGGAGGCTT ACCCTTGGGG
GCAGTGGTGG GGAGGAGTGA GGTCATGGAC CTCCCAAGAG GTTCCCATGC CAACACGTTC
GGGGGAAATG CCCTAGCCCT GGCCGCGGCC GAGGTCGTGC TAGAGGAGGT TCCAGGGCTT
CTAGGAAGGG TTAACTCCCT GGGTAAAATG ATCGTGGACA TTCTAGGCTC CACGAAGTCC
AGATACGTGG AGGAGATTAG GGGTATGGGT CTCATGATAG GAGTAGACCT AAGGAGAGAT
GGGGAACCCT ATGAGGAGGG GCTCGAGAAG GTTCTGAGGA GATCCTTCGA AAGAGGAGTA
CTCGCCATAG GAGCAGGTGA ATCAGTAGTG AGATTACTTC CTCCCCTCGT GATAGAGGAA
GAACTTGCTC AGAGGGGTAG CTCTATAATA AGGGAGGAAA TAGATAGGTT ATAG
 
Protein sequence
MSLASRIIEE DSRYLMQSFS RWYPLVIERA KGSLVYDVDG KEYIDMNAGI GVMALGHGNE 
KIIRAVQEQM NKFFHYSLTD FYYDLAVRVA RKLVSLVGFQ GKVFYANSGT ESVEASLKIA
RGHTGRQYII GFTNSFHGRT FGSMSFTSSK SVQRSAFSPL LPSTLLVPYP DRHNPLCRED
CANAVLEYIE DWVLKKIVDP NDVAGFLLEP IQGEGGIIVP PREFLQGLQR IARKNGILLI
LDEVQTGIGR TGKMFAFEHF GVEPDLICLA KALGGGLPLG AVVGRSEVMD LPRGSHANTF
GGNALALAAA EVVLEEVPGL LGRVNSLGKM IVDILGSTKS RYVEEIRGMG LMIGVDLRRD
GEPYEEGLEK VLRRSFERGV LAIGAGESVV RLLPPLVIEE ELAQRGSSII REEIDRL