Gene Smed_5089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5089 
Symbol 
ID5319391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp37849 
End bp39204 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content63% 
IMG OID640776868 
Productaminotransferase class-III 
Protein accessionYP_001313800 
Protein GI150377205 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.447769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACA ATCAGGATCC CGCCTTCTGG GCGGCCGCGG GCAGGCATCT GATCCGCTAT 
GGCGGAAGCT TCGACCCCGC GATCATCGAG CGGGCCAGGG GCTCTTTCGT TTTTGATGCC
GACGACCGGC CGATACTCGA CTTCACCTCG GGCCAGATGA GTGCACTGGT CGGCCATTCC
CATCCCCGGA TCGTCGCGAC CGTTCAGCGG CAGATGGAAA AGGTCGCCCA TCTTTTCAGC
GGAATGCTGT CGCGACCGGT CGTCGATCTC GCCGAGCGGC TGGCGGCGCT GGCACCCGGT
CTCGACAGGG TCATGCTGTT GTCGACCGGG GCGGAATCCA ACGAGGCGGC CATCCGCATG
GCCAAGCTCG TCACCGGCAG GCACGAAATC GTCGCATTCT CAAAGAGCTG GCACGGCATG
ACCGGCGCGG CGAGTTCGGC AACATACAGC GCCGGCCGCA AGGGTTATGG CCCGGCCATG
GTCGGTTCTC TGACCATTCC GGCACCCAAC ACATTCCGCC CGCGTTTTCG GCATGGCGAC
GGGAGCCTGG ACTGGAGGAC GGAGCTGGAC GATGCTTTCG CGCTGATCGA CAGTCAGTCG
ACCGGCAGCC TCGCCGCCTT CATCGCCGAG CCCATCCTGT CGAGCGGCGG ATTGCTCGAA
CTGCCGCAGG GCTATCTCGC AGCGCTCATG GAAAAATGCC GCGAGCGCGG AATGCTGCTC
ATTCTGGACG AGGCGCAAAC CGGAATCGGC CGGACTGGAA CCATGTTCGC GTTCCAGCGC
GACGGCGTTA CGCCCGATAT TCTGACGCTC TCGAAAACGA TCGGCGCCGG CCTGCCACTC
TCCGCCGTCA TGACAACGAC GGAGATCGAG GAGGCGGCGC ATGAGAAGGG CTTCCTTTTC
TACACCACGC ATGTCTCCGA TCCCCTGCCG GCCGCGGTGG GCCTTGCCGT GCTCGACGTC
GTCGCCGAGG AAGGGCTTGT CGAGCGCGCC CGTCATATCG GCGGCGAGCT CTTCGATGGC
CTGTCGCAGT TGAAGCAGAG ATTCGACTGC GTCGGCGACG TACGCGGTCG CGGCCTTATG
CTCGGCGTCG AAATCGTGAA ACCGGGTGAG AGCAGAAGTG CCGATCATGA GCTTGGCAGC
CGGATTGCCG CCGAAGCTTT CCGCCGTGGG CTCAGTATGA ATATCGTTAA GCTTCCCGGT
ATGGGCGGCG TCTTCCGCAT CGCGCCGCCA TTGACGATTT CCGAGGAGGA GATCGAGCTT
GGCCTGCGCA TCATCACGCA ATCCATCGAA GCATCATTGG CCATCGAAGC GGCATTGCCG
CTCGGCGCAA GCCGTCAGGA CGTTGCGGCA GAATAG
 
Protein sequence
MSNNQDPAFW AAAGRHLIRY GGSFDPAIIE RARGSFVFDA DDRPILDFTS GQMSALVGHS 
HPRIVATVQR QMEKVAHLFS GMLSRPVVDL AERLAALAPG LDRVMLLSTG AESNEAAIRM
AKLVTGRHEI VAFSKSWHGM TGAASSATYS AGRKGYGPAM VGSLTIPAPN TFRPRFRHGD
GSLDWRTELD DAFALIDSQS TGSLAAFIAE PILSSGGLLE LPQGYLAALM EKCRERGMLL
ILDEAQTGIG RTGTMFAFQR DGVTPDILTL SKTIGAGLPL SAVMTTTEIE EAAHEKGFLF
YTTHVSDPLP AAVGLAVLDV VAEEGLVERA RHIGGELFDG LSQLKQRFDC VGDVRGRGLM
LGVEIVKPGE SRSADHELGS RIAAEAFRRG LSMNIVKLPG MGGVFRIAPP LTISEEEIEL
GLRIITQSIE ASLAIEAALP LGASRQDVAA E