Gene Glov_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGlov_0844 
Symbol 
ID6366992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter lovleyi SZ 
KingdomBacteria 
Replicon accessionNC_010814 
Strand
Start bp869696 
End bp870856 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content47% 
IMG OID642676239 
Productrestriction endonuclease S subunits-like protein 
Protein accessionYP_001951088 
Protein GI189423911 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGTA ACCCTATAAA GCCCGGCTGG CAGAAGGTCA AATTCGGGGA TGTCGTGCGC 
CTTTCGAAAG AGCGCAGCAG CGATCCGCTT GCCGATGGTT ATGAGCGGTA CATCGGTCTG
GAGCACATTG ACCCGGAGGA CTTGCGCGTC CGGCGCTGGG GCAATGTTGC CGATGGTGTC
ACCTTTACCA GCGTGTTCAA GCCGGGACAG GTTCTGTTCG GCAAACGTCG GGCATATCAA
CGCAAGGTAG CGGTGGCCGA TTTTGCCGGA GTCTGTTCAG GCGACATTTA TGTGCTGGAG
AGCAAAGACC CGAAGAAACT CCTTCCGGAA CTACTGCCGT TCATCTGCCA GACTGAAGCA
TTCTTTCAGC ATGCAGTCGG GACATCGGCA GGCAGCCTGA GTCCGCGAAC CAACTGGACA
AGTCTTGCTG ATTTTGAGTT TGCGCTGCCG CCGTTGGAGG AGCAGCGGCG GATTGTCGAG
TTACTTCTGG CTGTTGAGGA AACGATAGAT AACCTTGTCA GCGCAAGATC ATCTGCTCAG
CTATTATTCA AAGCAGCTTT GTTGGAAAGC TTCAATAGCC TCCCTGAAAA CAACAAAAAA
AAGATCGCTG ATTGTTATGA AATTCAGCTT GGGAAAATGA GCTCGGAAAA AGCTCGTTTT
GGTTCCAATC AAAAAACCTA TATAAAAAAC AATAATGTGC TTTGGGGGAA ATTTGACTTT
GGCGAATTGC CTCAAATGTC ATTTGATGAA CGAGAAATCA CAAAATATGA ACTGAGAAAG
GGAGACCTTC TTGTATGTGA AGGCGGTGAA ATAGGCCGTG CAGCCATTTG GCAAGATGAA
ATCCCTGGGA TGTTGTATCA AAAGGCATTG CACAGGCTGA GGCCACGAAC ATCTGATGAT
ATCCCGGAAT TTATGTTTCA TTATCTTCGC TATTGTGCAG AGAGAGGTAT TTTAGACGGA
GTTGCCACAG GAACAACTAT TCGACATCTC CCTGTAGAAC AACTTAGTCA ACTTGCTCTA
CCATTTCCCA AGCGAGCTGT TCAGGAGCAG GTGGCAAGTT TACTTTCAAA AATTGAATCA
GGAAATTCAA TGCTTGACGC CAAGATATGT CATTCAAGAA GTCTAAAATC TGCGGTGTTA
CGTCAGATTG CAGGAGGATA A
 
Protein sequence
MSSNPIKPGW QKVKFGDVVR LSKERSSDPL ADGYERYIGL EHIDPEDLRV RRWGNVADGV 
TFTSVFKPGQ VLFGKRRAYQ RKVAVADFAG VCSGDIYVLE SKDPKKLLPE LLPFICQTEA
FFQHAVGTSA GSLSPRTNWT SLADFEFALP PLEEQRRIVE LLLAVEETID NLVSARSSAQ
LLFKAALLES FNSLPENNKK KIADCYEIQL GKMSSEKARF GSNQKTYIKN NNVLWGKFDF
GELPQMSFDE REITKYELRK GDLLVCEGGE IGRAAIWQDE IPGMLYQKAL HRLRPRTSDD
IPEFMFHYLR YCAERGILDG VATGTTIRHL PVEQLSQLAL PFPKRAVQEQ VASLLSKIES
GNSMLDAKIC HSRSLKSAVL RQIAGG