Gene GM21_0585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0585 
Symbol 
ID8135900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp719736 
End bp720806 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content61% 
IMG OID644868202 
Productbranched-chain amino acid aminotransferase 
Protein accessionYP_003020417 
Protein GI253699228 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID[TIGR01123] branched-chain amino acid aminotransferase, group II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.00279686 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGATCA GGATTGTGCC CCTCAGCGAG GGAGAGAAGA AGCCGAAATT CACCGACGAA 
TCACAGCTTG GCTTTGGCAA GATTTTCACC GACCGCATGC TTTTGGTCGA GTGGAAAGTC
GGCCAGGGGT GGGTAGATGC CAGGATCAAG AAGTACGAAC CGTTCCTGCT CGATCCGGCG
GCGCTCGTGC TGCATTATGC CCAGGAAATC TTCGAAGGGC TCAAGGCCTA CAAGTGGAAG
GACGGCAGCA TCGCGCTGTT CCGCCCCGAG ATGAACGCCC GCCGCTTCAA CCACTCCGCT
GACCGTCTCT GCATGCCCGA GATCCCGGAG GAGCTCTTTG TGAGCGGCAT CGAGCAGTTG
GTCTCCGCCG AGCGCGACTG GGTTCCCGGC GCCGAAGGGA CTTCGCTCTA CATCCGCCCC
ACCATGATCG CGGTGGAGCC GCTGGTCGGC ATCAAGCCTT CGGACCACTA CTACTTCTAC
GTGATACTCT CCCCGGTCGG CGCCTATTAC GCCAACGGCT TCAATCCGGT GAAGATCATG
GTCGAGGACC ACTACGTCAG GGCCACCCCC GGCGGCACCG GCGAGGCCAA GACCGGCGGC
AACTACGCCA GCTCCCTCAA GGCGGGGCTC GAGGCCAAGA AGAAAGGTTT CGACCAGGTG
CTCTGGCTGG ACGGCGTGCA CAAGCGCTAC ATCGAGGAAG TAGGCTCGAT GAACATGTTC
TTCGCCTACG GCGACACCAT CGTCACCGCG CCGCTTGAAG GAAGCATCCT GAACGGCATC
ACCCGCGACT CCGTGCTGAC CCTGGCCAAG TCGTTGAACC TGAAGGTCGA GGAGCGGCGC
ATCGACGTGA AGGACCTGAT GGCGGATCTC AAAAGCGGGA AGATCACCGA GGCGTTCGGC
AGCGGCACCG CCGCGGTCGT CACCCCTGTG GGTACGCTGA GCTATCTCGG CGAATCGGTA
CAGGTAGGAA CCGGCGGCGT CGGCAAATAC ACCCAGGTGC TCTACGACAC GCTGACCGGA
ATCCAGACCG GCAAGATCGA AGACAAATTC GGCTGGATCA GGAAGATCTA G
 
Protein sequence
MEIRIVPLSE GEKKPKFTDE SQLGFGKIFT DRMLLVEWKV GQGWVDARIK KYEPFLLDPA 
ALVLHYAQEI FEGLKAYKWK DGSIALFRPE MNARRFNHSA DRLCMPEIPE ELFVSGIEQL
VSAERDWVPG AEGTSLYIRP TMIAVEPLVG IKPSDHYYFY VILSPVGAYY ANGFNPVKIM
VEDHYVRATP GGTGEAKTGG NYASSLKAGL EAKKKGFDQV LWLDGVHKRY IEEVGSMNMF
FAYGDTIVTA PLEGSILNGI TRDSVLTLAK SLNLKVEERR IDVKDLMADL KSGKITEAFG
SGTAAVVTPV GTLSYLGESV QVGTGGVGKY TQVLYDTLTG IQTGKIEDKF GWIRKI