Gene Gura_3010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3010 
Symbol 
ID5165676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp3478416 
End bp3480596 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content57% 
IMG OID640550505 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001231755 
Protein GI148265049 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID[TIGR02543] Listeria/Bacterioides repeat 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATCCT TGCGCGGCAG CAAAAAATTT CACATGACGG TGCGGGTGGC AACCTTCCTG 
GCCGTATTAG GCGTCTCCGT GCATGCCTCT CTGGCAGCCA CCGACAGCTA TGCCATCAAC
AGCCGGCAGC AGTTTATTGA CAGGGGTAAA CTCCATGCCG GCGTCAAAAA CAGGGTGGAG
CAAACCGGCG AGGCAGACAT AATCCTCGTT CTCAACGACA TCGATGTGAT CAGCATGGCC
GACCGGATAA AAGAACGGCT TGAAATCAGG AACGACAGTC CTGAAATCAC GGCCGAAAAG
TCGAGGTTAT TCACCAATAA GAAGAAGCAG GCACTTACGG CACTATCCCC TGCACATTAT
CAGCTATTGC AGGATTATGA CCAGCTGCCG GTCATGCACC TGAAGGTGGA CGCAGAGGCG
TTAGATGCCC TGCTACAGCT GGATGAGGTG GTGCTCGTCA ACGAAGACCG GGCTTTTGTT
CCACATTTAT CGGCGAGCCT GCCACTCATC GGAGCCCTCG AGGCCCACGC TGCGGGAGCC
ACCGGGAGCG GCACATCGGT GGCAGTACTG GACACGGGCG TCAACTACAC GCTCGCTGCT
TTCGGCAGCT GCACTGCGCC AGGCATCCCT GCTGGCTGCA AAGTAGCATA TACCCAGGAT
TTCGCGGCCG ATGACGGGAG CCTGGACGAT AACGGCCATG GCACCAATGT GTCCGGGATT
ATTGTCGGTG TCGCCCCGGA TACGAAGATT ATCGGGCTCG ATGTCTTCCG AACCGACGGT
TATGCCTATG ATTCCGACCT GCTGGCGGCA CTGAACTGGG TACTGGCCAA TCGGACAACC
TACAACATTG CCGCGGTAAA CATGAGCCTG GGAGGGGGCA AGTATACATC AGGATGCCCC
ACTGACTCCC TCGCCTCAGC AATCAACAAC CTGAAAAGCG CCGGGGTGGT GAGCGCAATT
TCTTCAGGAA ATGACGGCTA TACGAACGCG ATTTCCTCGC CGGCATGTAT CCCTGCGGCC
GTTTCGGTCG GAGCCGTCTA CGATGCAAAT GTCGGCAGCA GGAACTGGAC CGTATGCAGC
GATCTGACCA CCACGGCGGA CAAGGTTACG TGCTTTTCCA ATAGCGCCTC ATTTCTGACC
ATACTCGCCC CTGGCGCCCT GATCAGTGCT GCAAATATCA CCATGGCGGG TACGTCCCAG
GCGTCGCCCC ATGTTTCGGG CGCAGTTGCC CTGATCAGGG GCCAACATGC ATCTCTCACC
GCGACAGAGA TCGTCAACAA GCTGTCGAGT ACCGGCGTAT CCGTTACCTA CAATAGCATC
ACCAAACCCC GTTTAAATGT GGCCGCAGCG CTACGATTTC CTCAGATAGC GACCCAGCCG
GCCTCCGTCG CTTTCGGCTA CCTCCTGATC GGCACAGGCG AACCCGCCCA AACCTTTACC
ATCACCAATA CCGGCGCACT GGATCTCTCC GTCGGCACCA TCGCCTTGAC GGGTGCCGTT
GCAGACTTTG TCATGCAAAG TGATTATTGC TCCGGGCAAA CCCTCACGCC GGCGGCAAGC
TGCAGCGTCA GCGTAAAATT TTCTCCCCAG TCGGCCGGTT TCAAAACCGC AGCTCTTTCC
ATCCCATCAA ATGATCCTGA TCAACCCTCT CTGGCTCTTC CTTTGAGCGG TACAGCCGGC
ATGAGCTACA CCCTGACAAC AGCTAAAGCC GGCTCGGGGA CCATTACCAG TGCTCCGGCC
GGCATCAGCT GCGGTGGAGA CTGCACTGAA CTGTATCCTC AAGGTGCGGC AGTAACGCTG
ACAGCCTTGC CAGACCCGGG TTGGGTCTTT GCCGGCTGGT CCGGGAGCGG CTGCAGCAAC
GGGCCCTGCG TGGTGACATT AAATGGCGAT TTCAGCATCA CGGCGCTCTT CAGTCTTCTG
CAGCCGGTCA AACTTTCTTC TGTTTCAGGA TCAGGTTATC CCACCATTCA ATCCGCCTAT
GATGCCGCAG CGCAGGTGGA CGCCATACAA ATCCTCGCAC AAACCTTCAC TGAAAATATC
TTCCTGAACC GGCCGGTCTC GGTTATCCTG ACTGGTGGCT ACGACAGCAC CTATACCAGT
CCCCAGGGGC TTACCACCAT TAATGGCACG ATGACCATCA GCGACGGTAC GGTAACGGTC
GCAAATCTGG TTATACTGTA A
 
Protein sequence
MGSLRGSKKF HMTVRVATFL AVLGVSVHAS LAATDSYAIN SRQQFIDRGK LHAGVKNRVE 
QTGEADIILV LNDIDVISMA DRIKERLEIR NDSPEITAEK SRLFTNKKKQ ALTALSPAHY
QLLQDYDQLP VMHLKVDAEA LDALLQLDEV VLVNEDRAFV PHLSASLPLI GALEAHAAGA
TGSGTSVAVL DTGVNYTLAA FGSCTAPGIP AGCKVAYTQD FAADDGSLDD NGHGTNVSGI
IVGVAPDTKI IGLDVFRTDG YAYDSDLLAA LNWVLANRTT YNIAAVNMSL GGGKYTSGCP
TDSLASAINN LKSAGVVSAI SSGNDGYTNA ISSPACIPAA VSVGAVYDAN VGSRNWTVCS
DLTTTADKVT CFSNSASFLT ILAPGALISA ANITMAGTSQ ASPHVSGAVA LIRGQHASLT
ATEIVNKLSS TGVSVTYNSI TKPRLNVAAA LRFPQIATQP ASVAFGYLLI GTGEPAQTFT
ITNTGALDLS VGTIALTGAV ADFVMQSDYC SGQTLTPAAS CSVSVKFSPQ SAGFKTAALS
IPSNDPDQPS LALPLSGTAG MSYTLTTAKA GSGTITSAPA GISCGGDCTE LYPQGAAVTL
TALPDPGWVF AGWSGSGCSN GPCVVTLNGD FSITALFSLL QPVKLSSVSG SGYPTIQSAY
DAAAQVDAIQ ILAQTFTENI FLNRPVSVIL TGGYDSTYTS PQGLTTINGT MTISDGTVTV
ANLVIL