Gene Gura_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2114 
Symbol 
ID5166289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2469483 
End bp2471027 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content58% 
IMG OID640549610 
Productpeptidase S10, serine carboxypeptidase 
Protein accessionYP_001230875 
Protein GI148264169 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000253475 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTCCGT CAATCCTTCT TGCCGCCGCG CTGATTACCG GAACCCCTTA TCATGCGTCA 
CATCCGGTGC CGGAGGCTGC TGTTGCTTCT GATGCGGCAA AGGGTGAAGA AAAACAGCCG
GAGAAGGACA AGAATGCGGC CGTCCCGGAA AAACCGGTGG TCACCAGGCA TAAGGTTGTC
GTGGAGAATA GGGAAATCGG GTACATGGTG ACCACAGGCC ATCTGCCGGT GATGAACGAT
GCCGGCGAAA GCGAAGCGCA GATCTTCTTT ATTGCCTATA CGGCTGACAA CCCATCCCCC
GGAATACGGC GGCCGCTCCT GTTCATCTTC AATGGCGGCC CGGGCGCGGC TTCGGTCTGG
CTTCACCTGG GCGCTGTCGG TCCCAGGCGC GTCCAGATGC TTCCTGACGG TAGGATGCCG
CCACCCCCTT ACCAGTTGGT GGATAACGAA TTCACCTGGC TGGATCAGGC CGATCTGGTC
TTCATCGATC CGGTCGGCAC CGGCTACAGT CGGGCGGTCA AGCCGGAGTT GACCAAGAAA
TTTGCCACGG TGCAGGGGGA CATCGATTCG GTTGGCAGAT TCATCAGGCT CTATCTGGCC
CGTTACGGGC GCTGGAATTC GCCGTTGTTC CTGGTGGGGG AGAGCTACGG TGCGTTCCGC
GCCGCCGGCC TTTCGGACTA CCTTTTTGAG CACGGCGCCG CCCTGAACGG GATCATCCTC
ATCTCTTCCG TCATGAACAT GCAGGCCATT TCGTTCGACC AGGGTAACGA TCTCCCCTAT
GAACTGTTCC TGCCCAGCTA CACGGCCACT GCCTGGTACC ATAAGAAACT CTCTCCGGAC
CTTCAGGGTG ATCTGGACAA GACGCTTGCG ACCGTTGAGA ACTGGGCTGC AACCGGGTAT
CTGACCGCCC TCGGCAAGGG AGATACTCTC TCTCCGGAAG AGAGGCGAAC GGTGGTCGAG
AAGCTATCCG CATTCACGGG GTTGGATAAA TCCTATATCG ACAACCGCAA CTTGCGCATC
GACAACAGGA GCTTTGTCAG AGACCTTTTG CGCGATCAGA GGCAAGTGGT CGGGTTTATG
GACAGCCGGT TCACGGCGGC GAACCTGGAC CCGGCGGCTC CTTTCGGGTT TGACCCGACC
GTAGCTACGA TCCGCGCCCC GTATACGGCC ACTTTCAACG ACTATGTCCG TCGTGAGCTC
GGATTCAAGT CTGACCTGGA ATACTTCACC TTGGGCGGAG GGATCGGACG TTGGGACTGG
GAGGCGAAAA ACGGTTACGC CGACAGCAGT GAGAATTTGC GCAATGCCTT TGCCAAAAAC
CCGTACATGA AGCTTTTCGT GGCATCGGGC TGCTTCGACC TGGCAACCCC GCATTTTTCC
ACGGAATATA CCATAAACCA CCTGGGTCTG ACCCCGGCCC TGAGGGGAAA CATAACAACC
CGTCGATACA GGGCAGGGCA CATGATGTAT CTGGACAGGA CGTCGCTTTC CCAGTTGAAA
AAGGATGTTG CGGCGTTTAT CGCAGGTGCT CTGGTAGAGC GATGA
 
Protein sequence
MIPSILLAAA LITGTPYHAS HPVPEAAVAS DAAKGEEKQP EKDKNAAVPE KPVVTRHKVV 
VENREIGYMV TTGHLPVMND AGESEAQIFF IAYTADNPSP GIRRPLLFIF NGGPGAASVW
LHLGAVGPRR VQMLPDGRMP PPPYQLVDNE FTWLDQADLV FIDPVGTGYS RAVKPELTKK
FATVQGDIDS VGRFIRLYLA RYGRWNSPLF LVGESYGAFR AAGLSDYLFE HGAALNGIIL
ISSVMNMQAI SFDQGNDLPY ELFLPSYTAT AWYHKKLSPD LQGDLDKTLA TVENWAATGY
LTALGKGDTL SPEERRTVVE KLSAFTGLDK SYIDNRNLRI DNRSFVRDLL RDQRQVVGFM
DSRFTAANLD PAAPFGFDPT VATIRAPYTA TFNDYVRREL GFKSDLEYFT LGGGIGRWDW
EAKNGYADSS ENLRNAFAKN PYMKLFVASG CFDLATPHFS TEYTINHLGL TPALRGNITT
RRYRAGHMMY LDRTSLSQLK KDVAAFIAGA LVER