Gene GM21_2973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2973 
Symbol 
ID8138316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3454263 
End bp3455366 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content62% 
IMG OID644870571 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_003022760 
Protein GI253701571 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.000000000872397 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGATC CTCGCATAAG ACAATTCGCT GAAGTCCTGG TCGATTACTC GGCACGCGTG 
CAGAAAGGCG ACGTGGTGCT CATCTCCTGC GCCGGCACCG AAGGCGCGCC TTTGGTCAAG
GAACTCTACG CGCTCTGCCT GGAGCGGGGG GCCAAGTACG TCGAGTACGA GTTCACCATT
CCCGACATCA ACCGCTACTT CTACAACCTA GCCAGCGCCG ACCAGCTCTC CTACTTCCCC
CAGCACAAGC TGGACTTCAT GAAGCAGGCG GACGTCTACA TCGGGCTCAC CGCGTCGGAC
AACTCGATGG TGATGGCTAA GGCGAAGCAG GCCAGCATGA TCGCCTGGTC CAAGGTGGTG
CGCCCCATCA TCGACCAGCG CGTCAAGCAC ACCCGCTGGG TCATCACCCG CTATCCGACC
CAGTCGGCGG CCCAGGAAGC GCGCATGAGC CTCGACGAGT ACGAGGATTA CCTCTTCGCG
GCCTGCTGCA TGGACTGGGA GGAGGAGTCG AGGAAGCAGG ACGCGCTCAA GGCGTGCGTG
GACGCGGCGG ACCGGGTGCG CATCAAGGCC TCCGACACGG ATCTTTGCTT CAGCATCAAG
GGGCTTCCCG GCATCAAGTG CGACGGGCGC CTCAACATCC CCGACGGCGA GGTCTTCACC
GCGCCGGTGC GCGACTCGGT GCAGGGATAC ATCACCTACA ACTGCCCCAC CGTGTACCAG
GGTAAGGAAT TCAACAACAT CCGCCTGGAG TTCGAGAACG GCCGCATCGT CCGCGCCAAC
TCGCCTGGAA TGGACGAGGA GCTGAACCGG ATCCTCGACA CCGACGACGG GGCGCGTTAC
GTCGGCGAAT TCGCCATCGG CGTCAACCCG AAGATCACGG TGCCTATGCG CAACATCCTT
TTCGACGAGA AGATCTTCGG CTCCATCCAC TTCACGCCCG GGCAGGCCTA CGACGAGTGC
GACAACGGCA ACCGCTCCGC GGTGCACTGG GACATGGTGA AGATACTGGC CGGCGACGGC
GAGCTTTGGT TCGACGAGAT CCTGATCCAG AAGGACGGAC TCTTCGTCCA CGAGCCCCTG
CTCGGTCTCA ACCCGGGAGC TTAG
 
Protein sequence
MKDPRIRQFA EVLVDYSARV QKGDVVLISC AGTEGAPLVK ELYALCLERG AKYVEYEFTI 
PDINRYFYNL ASADQLSYFP QHKLDFMKQA DVYIGLTASD NSMVMAKAKQ ASMIAWSKVV
RPIIDQRVKH TRWVITRYPT QSAAQEARMS LDEYEDYLFA ACCMDWEEES RKQDALKACV
DAADRVRIKA SDTDLCFSIK GLPGIKCDGR LNIPDGEVFT APVRDSVQGY ITYNCPTVYQ
GKEFNNIRLE FENGRIVRAN SPGMDEELNR ILDTDDGARY VGEFAIGVNP KITVPMRNIL
FDEKIFGSIH FTPGQAYDEC DNGNRSAVHW DMVKILAGDG ELWFDEILIQ KDGLFVHEPL
LGLNPGA