Gene Gmet_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_1201 
Symbol 
ID3738741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp1348163 
End bp1349710 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content54% 
IMG OID637778481 
Productpentapeptide repeat-containing protein 
Protein accessionYP_384163 
Protein GI78222416 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0514272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.288763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCGT CCAATCCAGC ATTTTCAGGC GACACTGAGC ATCTCAGCGT CCTGCTAAGG 
GCAGTTGAAC AGTGCGACAT ATCAATCTGG AACAACTGGA GAACGACCAA TCCAGCAATA
CGCCCCGCCC TGACAGGTGC TACCCTTGAA GGAGCAGTTC TGTTCGATGC GAATCTGGCA
GAGTCAAACA TCGGGAACGC AAATCTTCGC AACTGCCGGA TGATGGGGAC CAATCTGCGG
GGCGCCATCC TTGCCGGCAC TGATTTGGAC GGTGCCAACC TTTTCGGTGC AAACTGCCAG
GGAGCAAATT TCTGCAACAG CAGTCTTCGG GGAGTAACCC TTCGCTGCGG CATGCAAGGG
TGCATTCTCG AAGGTGCGAA CCTTGAGCGC GCCGATATGA TCGGGACCGA TTTCACCGGT
GCACGACTGG CAGGAGCATC GTTCCGAAAT GCTGAAGCCT TTGGCGCCCG GTTCCGGAAC
GCGATCCTCA CAGGTGCTGA CCTATCAGGA ATCAACTTGG GCACTGCAGA CCTGTTCAAG
GCAGAGTGCA ACGAAGCGGT GTTTTCCGAT GCCAGCCTCC TCAGCGCAAC GTTTAGCGAG
GCGAGCCTCC GCCATGCATC CTTCTGCCGG GCAGACCTCT CAAATTCCTA TCTCAATAGC
GCAACACTCA TCGCCGCCGA TCTCAGCGGA GCTAAACTTG CTTTCGCGAA CCTCATGGAA
GCCGATTTGC GCGACGCTAT CTTCCGAGGA ACGAATCTCC AGGGGGCTGT GCTCATGAAT
GCGAAACTGA CTGGGGCCGA CCTTCGCAAC GTCAACATGA GTCTATCTAA CCTCACAATG
GCCGATCTTG CGGAGGCCAA CCTCACCAAT GCAAATCTGA CGGGCGCCCT AATGGTCGGC
ACCGATCTGA GACGCGCCAC CCTCAGGGGA TGCTTGGTCT TCGGGATTTC ACCGTGGGAG
GTACTCGCGG AGGATGCCGA CCAACAGGAT CTGGTAGTCA CTAAGTTGAA CCACCCGCAG
GTTACGGTAG ACGACATCGA GATGGCCCAG TTTATCTATT TGCTTCTCAA TAACGCCAAG
GTACGCCGAG CTGTAGACAC TATAACCTCG AAGATTGTAC TCATCCTCGG CCGTTTCTCT
CCAGAGCGAA AATCGGTGCT CGATGCCGTG CGAGATCAGC TTAGAGCCTG CAACTACGTA
CCTGTACTCT TTGACTTCAA TCGGCCAGAG TCCCTCGATG AAGTGGAAAC CGTTACACTC
CTTGCACGGA TGGCCAGATT CATTATTGCC GACCTGACGG AGCCTCACTG CGTTCCCGAT
GAACTACGCT CAATCGTTCC CGATGTAGAA GTGCCTGTTC TGCCAATAAT CGAAGGAGAT
AACCCCTACG GAACATTTGG AACTCTCCGC AAATATCCGT GGGTGCTGGA CATACATCGA
TATCGGGGCA TTGAAGACCT GCTTGCCTCC TTTGACGAAT CGGTCGTCGT CCCTGCCGAG
AACATGACTC GGGAAATTGG AAAGCGTAAA GCACTTCGAA ATAATTGA
 
Protein sequence
MNPSNPAFSG DTEHLSVLLR AVEQCDISIW NNWRTTNPAI RPALTGATLE GAVLFDANLA 
ESNIGNANLR NCRMMGTNLR GAILAGTDLD GANLFGANCQ GANFCNSSLR GVTLRCGMQG
CILEGANLER ADMIGTDFTG ARLAGASFRN AEAFGARFRN AILTGADLSG INLGTADLFK
AECNEAVFSD ASLLSATFSE ASLRHASFCR ADLSNSYLNS ATLIAADLSG AKLAFANLME
ADLRDAIFRG TNLQGAVLMN AKLTGADLRN VNMSLSNLTM ADLAEANLTN ANLTGALMVG
TDLRRATLRG CLVFGISPWE VLAEDADQQD LVVTKLNHPQ VTVDDIEMAQ FIYLLLNNAK
VRRAVDTITS KIVLILGRFS PERKSVLDAV RDQLRACNYV PVLFDFNRPE SLDEVETVTL
LARMARFIIA DLTEPHCVPD ELRSIVPDVE VPVLPIIEGD NPYGTFGTLR KYPWVLDIHR
YRGIEDLLAS FDESVVVPAE NMTREIGKRK ALRNN