Gene GSU1772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1772 
SymbolctpA-2 
ID2686575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1935660 
End bp1936991 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content53% 
IMG OID637126452 
Productcarboxy-terminal processing protease 
Protein accessionNP_952822 
Protein GI39996871 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.011127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAGA CCATCAAAGG TAAACGCGTC GCACTGTTGC TTGCCTCGCT TTGTGTTGTT 
GCCGTGCTCG GTGCCGGTGC CGTTCAGAAG CGGTGCGCAG CTGAGGGAGG GAACGATTAC
GAGTCCATCG AGCTCTTCAC TGATGTGTTG GCGATCGTCA AGAAAAGCTA TGTTGAAGAG
GTGGACACCA AGAAGCTCAT CTACGGAGCC ATCAACGGTA TGCTTGCTTC ACTTGATCCA
CACAGCTCCT TCATGCCTCC CGACATGTAC AAGGAGATGA AGATCGATAC GAAGGGGTCT
TTCGGCGGCC TGGGTATCGA GATTACGATC AAGGATGGAC TCCTCACGGT AATATCCCCC
ATAGAGGACA CTCCTGCCTT CAAGGCCGGC ATCAAGGCGG GAGATCAAAT CTTGAAAATC
GAAGACCGCT TTACCAAGGA CATGACCATC ATGGATGCGG TCAAGAGAAT GCGGGGCCCC
AAGGGGACGA AAGTAACCCT TACCATTATG CGTGAAGGTT TCGACAAACC GAAGGAATTT
ACGCTCGTTC GCGATACCAT TCAGGTCAAG AGCGTGCGGT TCAAATCGAT GGATCAGGGG
TATGGTTACA TAAGAATCGC ACAGTTCCAG GAAAAGACGG ACGATGACCT GGTCAAGGCG
CTCAAGGCAC TCAAGGAAGA GAATGGCGGA GATTTGAGGG GACTCGTCCT CGACCTCCGT
AACGATCCGG GCGGACTTCT CGATCAGGCT GTCAAGGTTG CCGATCACTT TGTCGAAGAT
GGACTCATTG TGTACACGGA GGGGCGTGAG AAGGAGTCGC GGATGCAGTT TACCGCCCGC
AAGTCCGGCA CTGAACCCAA CTACCCGATG GTTGTGCTGA TCAACAGCGG AAGCGCCAGT
GCTTCTGAAA TTGTCGCTGG TGCGCTGCAG GATCATAAGC GTGCCGTTGT CATGGGGACC
CAGAGTTTCG GGAAAGGCTC GGTCCAGACA ATCATCCCCC TCTCCGATGA GTCTGGTCTC
CGACTCACCA CGGCACGGTA TTTCACGCCG AGCGGTCGTT CCATCCAAGC CAAGGGCATA
ACGCCGGACA TCGTTGTGGA GCGCGCGGAA ATCCAGTCTA CAGAGAAGAT GGAAGGCCAT
ATCCGCGAGA AAGACCTTGA GAATCATTTC GATTCCGACT CGAAGGACGG ATCGGACAAC
AAACAAAAAG GAACAGATAA AGGTGCTTCG GCAGCATCCA AGGTCGATGA GCAGTTGAAG
AGCGATTATC AGGTGATGCG CGCGCTGGAT CTCCTGAAAG GGTGGGAAAT CCTGAAAACA
ATAAGCAAAT GA
 
Protein sequence
MFKTIKGKRV ALLLASLCVV AVLGAGAVQK RCAAEGGNDY ESIELFTDVL AIVKKSYVEE 
VDTKKLIYGA INGMLASLDP HSSFMPPDMY KEMKIDTKGS FGGLGIEITI KDGLLTVISP
IEDTPAFKAG IKAGDQILKI EDRFTKDMTI MDAVKRMRGP KGTKVTLTIM REGFDKPKEF
TLVRDTIQVK SVRFKSMDQG YGYIRIAQFQ EKTDDDLVKA LKALKEENGG DLRGLVLDLR
NDPGGLLDQA VKVADHFVED GLIVYTEGRE KESRMQFTAR KSGTEPNYPM VVLINSGSAS
ASEIVAGALQ DHKRAVVMGT QSFGKGSVQT IIPLSDESGL RLTTARYFTP SGRSIQAKGI
TPDIVVERAE IQSTEKMEGH IREKDLENHF DSDSKDGSDN KQKGTDKGAS AASKVDEQLK
SDYQVMRALD LLKGWEILKT ISK