Gene Gmet_1234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_1234 
Symbol 
ID3738411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp1392107 
End bp1393723 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content62% 
IMG OID637778514 
Productextracellular solute-binding protein 
Protein accessionYP_384195 
Protein GI78222448 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.871598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.318753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCAC GGCTGTTCTG CTGTCTTGCG GCACTGGTTA CCCTTGTTTC GTGCAGCGGC 
GGTACGCCTC CTGCGGAACT GCCGCGGCGT GGAGCAGGGG CCCCTGCCTA CGGCGATGCC
CTGGTGGTTG GCACCATTGG CGAACCATCC AACCTGATCC CGCTGCTTGC CTCCGATTCC
GCCTCCCACG ATGTGGCGGG GAACGTCTTC AACGGCCTTG TGAAGTATGA CAAGAATCTT
GAGCTGGTGG GGGATCTGGC GGAGTCGTGG CAGGTATCCC CCGACGGCCT CACCATCACC
TTCAGATTGC GCAAAGGGGT GAAGTGGCAC GACGGCACCG AGTTCACCTC CCGCGACGCC
CTCTACACCT ACCGCGTCAC CATCGATCCC AAGACCCCCA CTGCCTATGC CGAGGACTTC
AAGCAGGTGA AGAGCGCAGA AGCGCCGGAC CGGTACACCT TCCGCGTGAC TTACGGCAAG
CCCTTTGCCC CGGCTCTGGC CTCCTGGGGG GCGGGAATCC TGCCGGCCCA CCTCCTTGAG
GGGAAGGACA TCACGAAAAG CCCACTTTCC CGCCATCCCG TGGGGACCGG CCCCTACCGG
TTCAAGGAGT GGATTGCGGG GCAGAAGGTC GTCCTTGAGG CATTTCCCGA CTACTTCGAG
GGGCGGCCCT ACATTGACCG GTTCGTCTAT CGGATTATCC CCGATACTTC CACCATGTAC
CTGGAACTGA AGGCGGGCGG CCTCGACATG ATGGGGCTTA CGCCGGTCCA GTACGCCCGG
CAGACCGATA CCCCCGATTT CCTGGCCCGT TTCAATAAGT TCCGCTATCC GGCCTCTGCG
TACACCTACC TGGGATACAA TCTCCGCAAC CCTCTTTTCG CCGACCGGCG GGTCCGACAG
GCCATTGCCT GCGCCATCAA CAAGGACGAG ATAGTCCACG GGGTGCTTCT GGGGCTGGGG
CAGGTGGCCC ACGGGCCCTT CAAGCCGGGA ACCTGGCCAT ACAACCCTTC CGTGAGGGAC
TTCGGCTACG ACCCGGCCCG GGCAAAGGCG TTGCTGGCAG AGGCGGGGTG GCACGCGGTC
GGGCCCGATG GTATCCTTAC GAAAGACGGC AAGCCCTTCA GTTTCACGAT TTTTACCAAC
CAGGGAAACG ACCAGCGGCT CAAGACCGCC CAGATCATCC AGCGTCGCCT CGCAAAGGTT
GGGATCGAGG TGAAGATCCG GGTTCTGGAG TGGGCGTCGC TTCTCACCAA CTTCATCGAC
AAGCGGAATT TCGACGCCCT CATCATGGGA TGGACCATTC CCCAGGATCC GGATATCTTT
GATGTGTGGC ATTCCAGCAA GACCGGCCCC AAGGAACTGA ACTTCATCGG GTTCAAGAAC
GCCGAGGTGG ACCGGCTCAT CGAAGAGGGG CGCAGTACCT TCGATCAGGA AAAGCGCAGG
CGCTGCTACT GGCGCATCCA GGAAATACTC GCCCAGGAGC AGCCCTACAC GTTTCTCTTC
GTCCCCGATG CCCTGCCGGT CGTGAATGCC CGCTTCCGGG GGATCGAGCC AGCGCCGGCC
GGAATCATGC ACAACATCAT CCGGTGGTAT GTGCCCAAGG AAGAGCAGGT GCATTAA
 
Protein sequence
MTARLFCCLA ALVTLVSCSG GTPPAELPRR GAGAPAYGDA LVVGTIGEPS NLIPLLASDS 
ASHDVAGNVF NGLVKYDKNL ELVGDLAESW QVSPDGLTIT FRLRKGVKWH DGTEFTSRDA
LYTYRVTIDP KTPTAYAEDF KQVKSAEAPD RYTFRVTYGK PFAPALASWG AGILPAHLLE
GKDITKSPLS RHPVGTGPYR FKEWIAGQKV VLEAFPDYFE GRPYIDRFVY RIIPDTSTMY
LELKAGGLDM MGLTPVQYAR QTDTPDFLAR FNKFRYPASA YTYLGYNLRN PLFADRRVRQ
AIACAINKDE IVHGVLLGLG QVAHGPFKPG TWPYNPSVRD FGYDPARAKA LLAEAGWHAV
GPDGILTKDG KPFSFTIFTN QGNDQRLKTA QIIQRRLAKV GIEVKIRVLE WASLLTNFID
KRNFDALIMG WTIPQDPDIF DVWHSSKTGP KELNFIGFKN AEVDRLIEEG RSTFDQEKRR
RCYWRIQEIL AQEQPYTFLF VPDALPVVNA RFRGIEPAPA GIMHNIIRWY VPKEEQVH