Gene GM21_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2226 
SymbolpheS 
ID8137564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2597516 
End bp2598532 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content64% 
IMG OID644869840 
Productphenylalanyl-tRNA synthetase subunit alpha 
Protein accessionYP_003022033 
Protein GI253700844 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0016] Phenylalanyl-tRNA synthetase alpha subunit 
TIGRFAM ID[TIGR00468] phenylalanyl-tRNA synthetase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.00150996 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGGATA AACTGGAAGC ACTTTTGGAT CAGGCCCTCT CCGAGCTGGC GCAAGCCTCC 
ACCGAGGAGG GCGTGCAGGA GCTGCGGGTC AAGTACCTGG GGAAAAAGGG TGAGCTGACC
TCCGTGATGA AGGGGTTGGG CGCGCTCACC CCGGAGGAAC GCCCCATCAT CGGCCAGGTG
GTGAACACGG TCAAGGGCAA GCTGGAAGAG GGGTTCGAGC TCCGCGGCGG CGAGATCCGC
GAGGCCGTGA AGAGCGCCCG GCTCTCCGCC GAGAGGATTG ACGTGACCCT TCCGGGCCGC
CGCCGGCCGC TGGGCTCCAA GCATCCCATC ACGCTCGTCA CCGAGGAGAT CGCCTCCATC
TTCGGCGCGC TCGGCTTCGC CGTAGCCGAA GGGCCCGAGA TCGAGCTCGA CTTCTATAAC
TTCGAGGCGC TCAACCTGCC GAAGGACCAT CCCGCCCGCG ACATGCAGGA TACCTTCTAC
TTCGGCGAGA GCGTCCTTCT GAGGACCCAC ACCTCTCCGG TGCAGATCCG CACCATGCTG
AAGCAGCCGC CGCCGGTCCG CATCATCGCC CCCGGCACCG TGTACCGCTG CGATTCCGAC
GCCACCCATT CGCCTATGTT CCACCAGGTC GAGGGGCTCA TGGTGGACAA GGGAATCACC
TTCGGCGACC TCAAGGGGAT CCTGACCCTC TTCATCAGCC AGCTCTTCGG TTCCGACATC
GGCGTGAGGC TGCGCCCCTC GTTCTTCCCG TTCACCGAGC CGTCGGCCGA GGTCGACATC
GCCTGCGTCA TCTGCCGCGG CAAAGGTTGC CGGGTCTGCA AGGAGACCGG CTGGCTCGAG
ATCCTGGGCG CCGGCATGGT CGACCCTGAG GTGTACCGCC ACGTGGGCTA CGACTCCGAG
CTCTACACCG GCTTCGCCTT CGGGATGGGT ATCGAGAGGA TAGCCATGCT GAAGTACGGC
ATAGCCGACA TGAGGCTCCT GTTCGAGAAC GACCTCAGGT TCCTGAAGCA GTTCTAA
 
Protein sequence
MKDKLEALLD QALSELAQAS TEEGVQELRV KYLGKKGELT SVMKGLGALT PEERPIIGQV 
VNTVKGKLEE GFELRGGEIR EAVKSARLSA ERIDVTLPGR RRPLGSKHPI TLVTEEIASI
FGALGFAVAE GPEIELDFYN FEALNLPKDH PARDMQDTFY FGESVLLRTH TSPVQIRTML
KQPPPVRIIA PGTVYRCDSD ATHSPMFHQV EGLMVDKGIT FGDLKGILTL FISQLFGSDI
GVRLRPSFFP FTEPSAEVDI ACVICRGKGC RVCKETGWLE ILGAGMVDPE VYRHVGYDSE
LYTGFAFGMG IERIAMLKYG IADMRLLFEN DLRFLKQF