Gene GSU1790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1790 
SymbolloN-2 
ID2686408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1954021 
End bp1956444 
Gene Length2424 bp 
Protein Length807 aa 
Translation table11 
GC content54% 
IMG OID637126473 
ProductATP-dependent protease La 
Protein accessionNP_952840 
Protein GI39996889 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCG ACCATAGTAT AGAAAGCCTG GAAAGAGGGG ATCTGCAAAG ATTCCCTCTT 
TTACCGTTAA GGGACATTGT TGTTTTTCCG CACATGGTAG TTCCTCTTTT TGTGGGGCGA
GAGCGTTCGA TCTCTGCCCT TGAGGCGGCC ATGAACGGCA ACAGGATGAT TTTTCTCGCC
GCACAGAGAA ATGCCAAAAC GGAAGATCCC CGCCAGGAGG ATATCTACAC AACTGGTACC
ATCTCACAGA TCATACAACT GCTGAAGCTC CCCGACGGTA CAGTCAAGGT GTTGGTGGAG
GGCAAGCAGC GGGGCAGTCT TGTCTCCTTT CTACCTAACC CCGACTACTT CATGGCTGAA
ATACAGCCGT ATCTTGAGTC TCTTGAAGCA AACCCCGAAC TTGAAGCCCT TATTCGGAGC
ACCAAATCGG TGTTTGAAGG CTATGTGAAG CTGACCAAGG GCATTCCGCA GGAAGTGGTC
AGTGCTGTTG GGGGAATTGT TGAGCCCGGC AAGCTTGCCG ATACGCTGGC ACCCCATCTT
AATCTCAAGC TTTCCGACAA GCAGCTGCTA CTTGGTATAA TGACTCCCCA TGAGCGACTG
GAGAAGCTCT TGTCTTTCAT GGAGGCCGAA TTGGAGATTC TCCAACTGGA GAATAAAATC
CGGACCCGTG TCAAGAAGCA GATGGAGAAG AACCAGAAAG AATATTATCT GAACGAGCAG
ATGCGAGCCA TCCAGAAGGA GTTGGGAACC AAAGACGACT TCAGGCAGGA ACTCCTTGAA
CTGGAGAACA AGGCAACCAA AGGGAAGTTG TCCAAGGAAG CCCGACAAAA GGCCCTTGCA
GAACTCAAAA AACTGAAGCT CATGTCGCCT GCGTCTGCCG AAGCTGCTGT GGTGAGAAAC
TACGTCGACT GGCTTGTATC ACTTCCGTGG GCGAAGATGA CTCGCGAAAA GCATGACATC
ATCGGCGCTG AAGAGGTTTT GAACGCTGAC CATTACGGTC TTGAAAAGGT CAAGGAGCGT
ATATTGGAGT ATCTCTCCGT CCAGGCGCTC GTTAAGAAGT TGAAGGGACC TATTCTCTGT
CTGGTTGGTC CCCCCGGGGT GGGTAAAACT TCACTTGCCC GATCAATCGC CACGGCAACC
GGTCGTCATT TTGTCAAAAT GTCGCTCGGT GGAATGCGTG ACGAAGCGGA GATCCGGGGC
CACCGCCGTA CCTATGTTGG AGCCATGCCC GGCAAGATCA TCCAGAACCT CAAAAAGGCG
GGAAGTAACA ACCCTGTATT TTTGCTGGAC GAGATTGACA AAATGAGCTC CGATTTCAGG
GGTGATCCGG CGTCGGCACT GCTTGAGGTT CTCGACCCGG AGCAGAATGC CCATTTCAGC
GATCATTTTC TTGATGTGGA GTATGATCTT TCCCACGTCA TGTTCATTGC CACGGCCAAC
TCGACGCATT CGATTCCCCG TCCCCTGCTC GACCGGATGG AGGTGATTCG CCTTGAGGGC
TACACCGAGC ATGAAAAACT TGCTATTGCC GAGCGCTATC TCGTTGGTAA ACAGATGGCA
GCCAATGGGC TTTCTGAACA ACAGATCAGC ATCTCCGGAA AGGCGGTCAG TGAAATAATT
CGCTATTACA CCCGCGAGGC TGGCGTGCGC AATCTGGAGC GCGATATTGC GACGCTCTGT
CGCAAGGCGG CTCATCGTGT GGTCAAGGGC GAGAAAAAGA AGATTGTAAT TCAGCCCAAG
AATCTGTCTG GGTTCCTGGG GCCTCGCAAG TACCGGATCG GGACGGCAGA GGACAAAGAT
GCTCCCGGCT ACGCAACGGG TCTTGCCTGG ACCGAAGTCG GCGGGGACCT GCTTACCATT
GAGGTGGCGA TTGTGCCCGG CACCGGAAAG CTCCTCATTA CCGGGAAGCT GGGGGAGGTC
ATGCAGGAGT CGGCACAGGC GGCCATGACC TATGTGCGTT CGCGGGCGCA GATCCTGGGT
ATTGATAAAG AGTTTCACAA GAAGGCGGAC ATACATATTC ACGTGCCGGA AGGAGCAATC
CCCAAGGATG GCCCCTCGGC CGGCATTACC ATGGCTACGG CGATAGTGTC TGCTCTAACT
GGCCGGCCGG TTCGGCACGA TCTTGCCATG ACGGGTGAGA TTACTCTCCG CGGTAATGTG
CTTGCCATCG GGGGCCTCAA GGAAAAACTC CTGGCCGCGG GGCGAGGTGG AATTTCCACG
GTTGTCATCC CAGAGGAAAA TAAAAAGGAT CTGGCGGAAA TCCCACGGGA GGTTACGGTG
GGGCTTACTA TCGTTCCAGC CCGTCATATG GACGAAGTGC TTTCCCGGGC GCTCTTGGCT
GAAGGCGTTT CAAGTGGAGC TCCCTACCTC GCCTCGGAGG GGCATCCGGC GATATCTGAA
CAGGAAACCG TTACTGCCCA CTGA
 
Protein sequence
MSVDHSIESL ERGDLQRFPL LPLRDIVVFP HMVVPLFVGR ERSISALEAA MNGNRMIFLA 
AQRNAKTEDP RQEDIYTTGT ISQIIQLLKL PDGTVKVLVE GKQRGSLVSF LPNPDYFMAE
IQPYLESLEA NPELEALIRS TKSVFEGYVK LTKGIPQEVV SAVGGIVEPG KLADTLAPHL
NLKLSDKQLL LGIMTPHERL EKLLSFMEAE LEILQLENKI RTRVKKQMEK NQKEYYLNEQ
MRAIQKELGT KDDFRQELLE LENKATKGKL SKEARQKALA ELKKLKLMSP ASAEAAVVRN
YVDWLVSLPW AKMTREKHDI IGAEEVLNAD HYGLEKVKER ILEYLSVQAL VKKLKGPILC
LVGPPGVGKT SLARSIATAT GRHFVKMSLG GMRDEAEIRG HRRTYVGAMP GKIIQNLKKA
GSNNPVFLLD EIDKMSSDFR GDPASALLEV LDPEQNAHFS DHFLDVEYDL SHVMFIATAN
STHSIPRPLL DRMEVIRLEG YTEHEKLAIA ERYLVGKQMA ANGLSEQQIS ISGKAVSEII
RYYTREAGVR NLERDIATLC RKAAHRVVKG EKKKIVIQPK NLSGFLGPRK YRIGTAEDKD
APGYATGLAW TEVGGDLLTI EVAIVPGTGK LLITGKLGEV MQESAQAAMT YVRSRAQILG
IDKEFHKKAD IHIHVPEGAI PKDGPSAGIT MATAIVSALT GRPVRHDLAM TGEITLRGNV
LAIGGLKEKL LAAGRGGIST VVIPEENKKD LAEIPREVTV GLTIVPARHM DEVLSRALLA
EGVSSGAPYL ASEGHPAISE QETVTAH