Gene GSU3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3233 
Symbol 
ID2688298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3547120 
End bp3550323 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content63% 
IMG OID637127926 
Productcytochrome c family protein 
Protein accessionNP_954274 
Protein GI39998323 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0173325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGAACT TAATGGATCA AGACTCGGCA ACCCCGGGTG TGCAGGACAA GAAGTACACT 
GAAGCCGAAT TCAAGGCACA GGTCGACCTC TCCATGTGGG ACTCGGCCGT TTACTGCGGC
AGCTGCCACG TCGGCGGCGG TTTTGTGGAA AAGGACCGGA ATGGTGTCCG CTACTCCCAG
CGCCAGCCCG ATGCGGGCGG CCTCTTCGAC GCCTACCTCT TTTACGTGAA CGACACCTAT
GATCCCCTGA CCGGCATGCC GACCGAGAAG GTCGCCATGG CCCCCTGGAT CTATCCCCAG
TATGCGGATA ACAATCCCGC CAACGGGCCG GTCCTCGCCC CCAACGGTTG GGGCCGCGCC
GCCATGAGCC AGCCGGGCAT GCCCATCAGC GACGGGCAGC TCATGATGCC GAACGTCAAG
GAAATGGACT GCCTCTACTG TCACTTCGAG GGGTATGACA ACCTGATGCA CTCGGTCATG
AACTACTCCG GTGCACTCAA CGTGACGGCC ACCGTGGGCG CGGGACTCAT GGACACCAAT
AAGATGAGCC CCACCTACCA GGGCTACAAC GTCTCCCTGG TGGACGTGGA CCAGAACGGC
ATCGTCTCGC TCAACTCCAC GGCCCTCAGC CGCATTAAAG CCAATCCGCC GGCCGACAAC
TGCCGGAAGT GCCACATGCC CACCTCGCTC ACCGACCTGC CGAGCATGAT GAAGGACTTC
CTGTCGTCGG CGCCCATGAT CTACACGGGC AACTTCTCGG CCTCCATGAC CGGCCTCGAG
ATGCCCGCCT TCGACTTCAA CGCACCCTTC GGCTTCACAT GGAACTTCAG CCAGGGTCCC
TACGGCATCT CGCCGGTGGT CAACGTCACC AACATCTCCA GCTACATGAT CGCCGCCATC
GGCTACCCGG CGGCCATGGG CCAGACCATC TACAACAGCA TGCCCGGCTT CGGTGGCACG
GTTATCAACC CGATGATCGG CGAAATCGGC GGCGGCAACA AAGCCGGCAC CGGCCCGCTC
TACTACGAGA AGCTCGTGTT CGACGAAAGC GGCAACCCGG TTATGATGGG CATGGCTCCT
GCCATGGATC AGAACTCCCT CAAGAAGGGC ATCGTTCCCT TCCCGCGGGC CGAATGGTTC
AAGCGGGGTG ACCTGTGGGA CAACCGCGAC CAGGAAGTTC ACATCGGCAT GGAGTGCGCC
GGCTGCCACA TGGATACCGA CACCCTGAAG GTTGACGGCC CCGGCAAAGA CGGCAAGAGC
CTCTGCGATC CGGGCCGCGG CTACGACAGC GCCAGCGGCG TTGAGACCAG CACAGCCCTC
GGCATCGACA GCCGCAACAC CATCAAGAAG TGCGCTGACT GCCACGTGAC CGGCAAGAAC
AGCGACGGCG TCGCCATCGA TACCTTCAAC GCTCCCAACC CGACCTCGGC CCACGCCATG
TTCAACCTCA CCGCTCCCAT GGTCAACGCG GTCCGGATGA AGGCCGACGG CACTGGCGAA
GAAACCTTCC TCGGCAGCCA CATGGACATC ATTGACTGCG CCGTCTGCCA CACCTACAAA
AAACAGATGG TTGTCCGCGC CCTCGACTCC ACCTCCGGCA ACCGCTACCC GAACATGCTC
GGCTTCGACA CCTCCAAAGG GATGCTCGGC ATGTTCAACG AGCCGATGCC CGGCATGACC
AATGAAGGCG TCGAGTGGAA ACCGCTCTAC ACCTGGGCCA AGATCGGCGA CGGCGACAAG
ATCCTGCCCG ACGGCAGCGC CAACCCCAAC TGGCGCCGCA AGGTCTACCC GATCAACATC
ATCACTGCAG TCCTCTGGAA CAACATCGAT CCTTCCGTTG ATGCCAACGG CGACGGCGTC
ACCGGCCGTC CCGCAGCCCC GAACCTGACC TACTACGACC CGTGGATCAG CCGCGACATG
AAGGCCGGCG TCAATTACGG CCCCAGCGGT TTCGCTCCCG TGCCGGTAGG CTTCGGCAAC
GGCGCCTTCC AGAGCGCCTA CAACCCCGAC GGCACCTTCA CCGGCGCCTG GAACTACGTG
GGTGTGTACG GCGGCAACGC GGTCTTCTCG ACGCCCGAGG AGATTGAAAA CTACAAGGCC
TTCCGCAACA CCATCGCCCC CGCCGTTGAC GGCAAATCGT GGAGCGGGAC CCAGCTGCTG
CTTCTGGGCG GCCCCTACAT GATCACCCAC AACGTCCGGA GCACCGCAAA CTTCGTCCTT
GGTAAGAGCT GCGGCGACTG CCACGCCGCT GGTAAAGGGT TCTTCGACGG CGGCTTCAAC
ATGACCGGTA CCGCCATCAA GGCCAATGCC GGCGCCAGCT TCATGCAGTC CCCCGCAGAA
ATCCTCCAGA TCGTTGCCAA GGCAGAGGAT CTGGAAACCG GCGCCGAACT GGCCACCAAG
TCGGGCGCTG CCCGCGAGGT CAAGTTCGAG GAGCACGGCG ACTGGAACCC GGCCACCAAG
ACCTTCACTC CCAACCCGGC CGGTGAGTAC AAGAAGGCCA TCGATCTGAG CCGTAGCGAG
GCCCTCTATC CCGATGAAGG AACCTTCACC GCCGCCGACG GCACTGTCTA TCCGAACCGC
GCCGCTTACG TCGCCTACCT GACCGGCATC GGGGCAGCTC CCACGGCAGA CATCGCCACC
GTCGGATCCA ACAACACCGC CGGCCTCCCG ACCACGGCCG AAGTAACCGT CACCCAAGGC
ACGGTAACCC TGGCAGCTAC TGCAGCCGGC CCGCTCGCAG CGGACAAGTA CTCCTACGCC
TGGACCTGCA GTGATTCCAA TGTTACACTT TCCGGGCAGT CCGTAACGCG CGCCTTCAGC
GCCACCGGCA CCTACACCCT GACGCTGCGG GTGAAGAACC TCAGCACCAA CGAAGAGAAA
GTCGACCAGA TCAAGGTCAA GGTCACCGCT CCGGCTCCGG CCGCGGGCGT CGCCGTGGCC
GCCGCCGGCA TCTCCTACAA CAGCCCCTCC GCCGGCTACG CCATCATCCC GCTCACCATC
ACCGGCGTGA CCTTCAACAA GGTCAAGGTC GTCTGGGGTG ACGGCAACAC CAACATCTAC
ACAACCTCCG ACGCAAACTT CGTCCTGCCC TCCCACAAGT TCTGGGGCTA CCCGGCGAAG
AAATCCTTCA CCGCCAAGGT CTATGTCTAC AACGGCACCA CCCTGGCGGC GCAGAACGAT
AACATTTCCG TGCTGTTTCC CTAA
 
Protein sequence
MANLMDQDSA TPGVQDKKYT EAEFKAQVDL SMWDSAVYCG SCHVGGGFVE KDRNGVRYSQ 
RQPDAGGLFD AYLFYVNDTY DPLTGMPTEK VAMAPWIYPQ YADNNPANGP VLAPNGWGRA
AMSQPGMPIS DGQLMMPNVK EMDCLYCHFE GYDNLMHSVM NYSGALNVTA TVGAGLMDTN
KMSPTYQGYN VSLVDVDQNG IVSLNSTALS RIKANPPADN CRKCHMPTSL TDLPSMMKDF
LSSAPMIYTG NFSASMTGLE MPAFDFNAPF GFTWNFSQGP YGISPVVNVT NISSYMIAAI
GYPAAMGQTI YNSMPGFGGT VINPMIGEIG GGNKAGTGPL YYEKLVFDES GNPVMMGMAP
AMDQNSLKKG IVPFPRAEWF KRGDLWDNRD QEVHIGMECA GCHMDTDTLK VDGPGKDGKS
LCDPGRGYDS ASGVETSTAL GIDSRNTIKK CADCHVTGKN SDGVAIDTFN APNPTSAHAM
FNLTAPMVNA VRMKADGTGE ETFLGSHMDI IDCAVCHTYK KQMVVRALDS TSGNRYPNML
GFDTSKGMLG MFNEPMPGMT NEGVEWKPLY TWAKIGDGDK ILPDGSANPN WRRKVYPINI
ITAVLWNNID PSVDANGDGV TGRPAAPNLT YYDPWISRDM KAGVNYGPSG FAPVPVGFGN
GAFQSAYNPD GTFTGAWNYV GVYGGNAVFS TPEEIENYKA FRNTIAPAVD GKSWSGTQLL
LLGGPYMITH NVRSTANFVL GKSCGDCHAA GKGFFDGGFN MTGTAIKANA GASFMQSPAE
ILQIVAKAED LETGAELATK SGAAREVKFE EHGDWNPATK TFTPNPAGEY KKAIDLSRSE
ALYPDEGTFT AADGTVYPNR AAYVAYLTGI GAAPTADIAT VGSNNTAGLP TTAEVTVTQG
TVTLAATAAG PLAADKYSYA WTCSDSNVTL SGQSVTRAFS ATGTYTLTLR VKNLSTNEEK
VDQIKVKVTA PAPAAGVAVA AAGISYNSPS AGYAIIPLTI TGVTFNKVKV VWGDGNTNIY
TTSDANFVLP SHKFWGYPAK KSFTAKVYVY NGTTLAAQND NISVLFP