Gene Glov_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGlov_0043 
Symbol 
ID6365869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter lovleyi SZ 
KingdomBacteria 
Replicon accessionNC_010814 
Strand
Start bp42982 
End bp45036 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content58% 
IMG OID642675444 
ProductTonB-dependent receptor 
Protein accessionYP_001950301 
Protein GI189423124 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGCC CGCCCCGCAG ACGGAAAGCA AAGCTGCTCA ACTGGCTGGT TGCAAGCCTG 
CTGACCCTGT TACCCGCTGT TTCTCCGGCA GCGGAACCGC CCGGCGAGTT GCTTCCCCTG
CCTTCAGACC CTCGCGAAGT TTTCCGCCGA TCCGGGATCA ATTCATTCGA CAAGGGTGCC
CTTTCCAACA TTGTCATTCG CGGCTACCAG CGCGAAAACC TGATGATCAC CTTTGACGGA
GCACCCTATT TTGGTGCCAC CCCCTTTAGA AGCGACGCAC CTCCTTTTAT CGTCAACAAT
AGTGAAGTCG GCAGGATTAC CATTACCAAA GGTCCCTATA ACCTTGCCTA CCCCGGCGGA
GCGGGTGGCA GCATAGAAGT ACTTTCCCCG GAAAACCCCA GACGTTTTTC AGCCGGGGGT
TCACTTTCCT ATGGTTCCTA TGATGCCTTA AACGGCAGTG CTTTTTTAGC GGTCGGCAAC
CAGCAGGCAG ACTTCAGTGC CGGCTACCGG GGCCGGAGCT CCGGTGTGCC GGAGGCTGGC
GGCGGCGTCC CGCTGGTCCG CACCCCGTAC CCCAACCCCA ACAACAACTA TCGTATTGGT
ACTGAAGATC TGCCCATGTA CCGCCTGGAT AGCTTTTGGC TTAAAGGCGG GATCAGTCCC
ACCAGCAATA ATCGCCTTGA ACTATCCTAC TCCTTTATGC AGGGCAGTGA GATCAAATTT
CCCACCCAGA ACATCGATAT AGCAGACGAG CAGGTACACC GCCTGAATGG ACGACTTACC
CTGCGCAACC TCTCGCCCCT GGTCAGGGAG ATCAGTCTGC AGGGCTGGTG GAGCCGGGCC
CGGACCCTGC TGGATGATTC GCTGCGGGAG ACCTCGGATG CCACGAATAC TGCGCTGCCT
TATCGTGCCT TCTTGAGCCG GGGCTATGCC ACGTCCAATC GTTTTGAGGT GACCTCCACC
GGTGGCAGAC TGACATCACA GCTTGCCCTC GGGCCGGGCA TACTCAAAAA GGGGCTTGAC
GTTTACCAGC GAGACTGGAA CGGCAGCTAT GCGGCGCTCT TGAGGCAGGG GGCTGCAGCC
TGGCAGTACT ACGACAACCA GCCACTGCTG CCGGATGTGA TGACCCGCAA CCTGGGTATG
TTCTGGATCT ATGAAACCCC GCTCAGCGAC ACGATGCGGG CACTCATTTC TGCCCGGGGT
GATTTCAGCC GGGTTGATGC CAATGGACTG ACCCCGGACC GCATCCGGAC ACTCTACCAG
CCCTACTATC CCGGACAGGG GATTCCGGCT GGACGGGATT TTGCCGACTG GAGCGCCAAT
GCCCAGCTAT TCTGGAAGAT CAGACCGGAT CTGGAGCTGT TTCTCAAGGG TGGGCGGGCT
GTCAGAATCC CCGATGCCAG CGAACTGTAT ATGGGGCAGA CCAGACAGGG CAGCAATGTC
GTCAGCAATC CGTTTTTGCA GCAGACCGTG GTTAACCAGA TCGATACCGG TGTCAGCTGG
GCACGGGGCG GGCAGCAGGT TGAAGCAACC TTTTTCTATG GCGAGGCAAC CAACTTTATT
TTACCGGTAA AACGTCTCAG CAGCAGCCTG CCGCAGGCAC GCAGCACCAC CAACCTGAAT
GCCGTCATCT GGGGCGTGGA ATGCGAAGGT ATCGTGCAGC TACCGGCTGA CCTGAAATTT
TCTGCCATGC TTTCCTACAG TGAAGGCGAA AACCGGAGCA GTAACAGGCC GCTGGCGGAG
GTGCCGCCTT TACGCGGCCG GCTGGGACTG GCCTACGACA ACCGCCGTTT CTTTGCCTCT
ATCAACCAGA CCCTGGTGGC CCGGCAAAAC CGTTTTGATG CGACCCTGAA CGAGACCTCA
ATTCCCGGCT ACGCCGTTAC TGACCTGCAG GCAGGGTGTC GCTACAACGG TTTTACCTTG
ACAGCCAAAC TTAACAATCT CTTTGACACC CGCTATGTCA TGCCGCTCTA CTACCAGCGC
GACCCGCTCA GCCCGACAGC ACGTATCCCT GAAAACGGGC GGAATTTTAC CCTGTCAGCC
AGTTACCGTT TCTAG
 
Protein sequence
MQRPPRRRKA KLLNWLVASL LTLLPAVSPA AEPPGELLPL PSDPREVFRR SGINSFDKGA 
LSNIVIRGYQ RENLMITFDG APYFGATPFR SDAPPFIVNN SEVGRITITK GPYNLAYPGG
AGGSIEVLSP ENPRRFSAGG SLSYGSYDAL NGSAFLAVGN QQADFSAGYR GRSSGVPEAG
GGVPLVRTPY PNPNNNYRIG TEDLPMYRLD SFWLKGGISP TSNNRLELSY SFMQGSEIKF
PTQNIDIADE QVHRLNGRLT LRNLSPLVRE ISLQGWWSRA RTLLDDSLRE TSDATNTALP
YRAFLSRGYA TSNRFEVTST GGRLTSQLAL GPGILKKGLD VYQRDWNGSY AALLRQGAAA
WQYYDNQPLL PDVMTRNLGM FWIYETPLSD TMRALISARG DFSRVDANGL TPDRIRTLYQ
PYYPGQGIPA GRDFADWSAN AQLFWKIRPD LELFLKGGRA VRIPDASELY MGQTRQGSNV
VSNPFLQQTV VNQIDTGVSW ARGGQQVEAT FFYGEATNFI LPVKRLSSSL PQARSTTNLN
AVIWGVECEG IVQLPADLKF SAMLSYSEGE NRSSNRPLAE VPPLRGRLGL AYDNRRFFAS
INQTLVARQN RFDATLNETS IPGYAVTDLQ AGCRYNGFTL TAKLNNLFDT RYVMPLYYQR
DPLSPTARIP ENGRNFTLSA SYRF