Gene GSU0970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0970 
Symbol 
ID2687553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1044984 
End bp1047935 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content62% 
IMG OID637125640 
Producthypothetical protein 
Protein accessionNP_952024 
Protein GI39996073 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.351823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGCA AACGATATCG CTCAGACACG CGACCGGCCG CATGCGCCAT CCGCTTGCTT 
GCCGTGCTGG TTCTGCTGGC CGGTCTGGCG GCCCGGGCCG AAGCCGGGCC GGCCCTTCCG
GTCATGGTCA AGGATATCAA CACCGCATCG GTACCCGTCT CGTCCAGCCC CTCCGGCATG
ACGGCCAATG GCGGCATCCT CTTCTTCAGT GCCGGCGACG GCGCCAACGG CCCGGAACTC
TGGAAGAGCG ACGGTTCCGC GGAAGGGACG GTGCTCGTCA AAGATATCAA TGCCGGGCCC
GGGCCGGGCA CGCCTCAGAA TTTCTCCGTC ATGAACGGGA TCACCTATTT TTCCGCCATG
GACAGCTTCC GCGGCGTGGA ACTCTGGAAA AGCGACGGCA CGGCTGCTGG GACGGTCATT
GTTAAGGATA TCTACCAGGG GGGCGAATCA TCCAACCCCC TGGAGTTGAC AGTGGCAGGG
AACACGCTTT TCTTTTCCGC CGATCATCCC GTCTATGGCA AGGAACTCTG GAAGAGCGAC
GGCACCGCCG AGGGAACCGT TCTCGTGGCC GATATCGCTG CGGAGGCGAG TTCGACGCCC
CAGTGGCTGA GGGCGGTCAA CGGTACGCTC TTCTTCGCAG CCGACGACGG TCTCCATGGC
CGGGAGCTCT GGAAGAGCGA CGGGACGCCG GAAGGCACGG TGATGGTCAA GGATATCAAC
CCCTTCGGCG GGTCCGATCC GGGCGAGATG GCGGTGTCCG GCGGCATCCT CTACTTTACC
GCCGATGACG GTGAAAATGG CCATGAGCTC TGGAGGAGCG ACGGCACCGC CGAAGGGACG
TATCTGGTCG CCGATATAGC CCCCGGCGAA GAGAGTTCCT ACCCCTTCGA GCCGGTGGGC
ATCAATGGCC TGCTCTATTT TACGGCCAAT GACGGCTACA CCGGGTACGA ACTCTGGCAG
AGCGACGGCA CCCCCGAGGG GACGACGCTG GTGAAGGATA TCAATCCGGA CGGCGAAGAC
TCAATGCCCT GGGGCATAGT GGGCATGGAC AGGTACGTCT ACTTCGCGGC CGATGACGGC
GTCAACGGGT ATGAACTCTG GCGCACCGAC GGCACCATGG GGGGGACGGA GATGGTTGCC
GACATCCAGC CCGGGATGGG CGGTTCCATG TACAGCTCTC CCCGGCTGGT GAACGGCATG
CTCCTCTTTG CCGCCGACGA CGGAGAGCAC GGCATCGAGA TATGGAAAAG CGACGGCACT
GCCGAGGGCA CCCTCATGGT CAGGGATATC ATCCCGGACG CCATGTCGTG GCCATCCGAG
CTCATGGTGC ATAACGGTAC ACTCTACTTC GCAGCCGACG ACGGCGTAAA CGGCACGGAG
CTCTGGAAGA GCGACGGAAC GGCCGAGGGC ACGGTGCTGG TGCGGAACAT TGCACCCGAG
ACGGCCAGCA GTCTTCCCTA CCAACTGGCC GTGATGGGGA CCACGGTATT CTTTGCCGCG
GCCGATACCG ATCTTGACTT TGATGTCTGG AAGAGCGACG GTACTGCTGA TGGTACGGTG
CTCGTCAAGG AGATCAATCC CGAAGGGTGG GCCTATCTGG ACCGGTTGAT GGTCGTTGGT
GATACCCTTT ACTTCCTGGC CGAGGACAAC TATGGAGAGG CCAGCCATGG CATCGAACTC
TGGAAGAGCG ACGGCACCGC CGAAGGCACC AGGATGATCA AGGACATCAA CCCCGGGCCC
CAGGGGATAT TCTTTCCCGG CAACCCCAAT TATCCCTTCT CCATGGCCGC CGTCGGCACT
ACGGTGTATT TCCCCGGTTT TACCGCGGGC AATGGTCATG AACTCTGGAA GAGCGATGGT
ACCGCCGAAG GGACGGTCCT CGTGAAGGAT ATCAATCCCG TTTTCGATTT CTCCTCATTT
CCCGACAGTT TTACCGCCAT GAACGGAGCG GTCTATTTCG TGGCCGATGA CGGTACGCAT
GGCGCCGAGC TCTGGAAGAG CGACGGTACC GCCGACGGCA CCCGGATGGT CAGGGATATC
TATCCGGACG GCATCGGCTC CAGTCCCTTG TCGCTCACGG TCATGAACAA CGTCCTCTAC
TTTAGTGCCG CCGGTGACGA AGGTGGCTAC GGACTCTGGA AGAGCGACGG TACCGCCGAG
GGGACCACGT TCGTAAAGGA CACCTCTCCC TTCAATCACT CGCTTCTCCC TGCCTACCTG
ACCCCCGTAA ATGGAACTCT CTTCTTCGCC GCCCACGACG AAAACGCCGG ATTCGAGCTC
TGGAAGAGCG ACGGGACCAC CGACGGCACC GTGCTGGTGG CCGATATCCT GCCGGGGGAA
GGGGCGTCCA ATCTGCGCTT GCTTACCGGC GTGAACGGTA CCCTGTTTTT CGTGGCCGAC
GATGGAGTGC ACGGCGAGGA GCTCTGGAAA AGCGACGGGA CGCCCGAGGG AACGGTGATG
GTGAAGGACA TTTTTCCGGG GGATGGGATA TCTGGCATCA CCTGGATCAA GGTGATGAAC
GGGATGCTCT ATTTTGCGGC CGACGACGGA GTGAACGGCC TCGAACTCTG GCAGAGCGAC
GGAACCGCCG AGGGGACGGT GCTGGTCACG AACATCGTGG CGGGCCAGGG GAGTTCGTCT
CCCTCGTATC CGGTGGTGGC GGGCAATACA CTCTACTTTG CCGCTACTGA CGGAGGCAGC
GGCGTCGAGC TCTGGAAGTT TTCGCCCGAT CCGCCCGATG GTGATCTGAC CGGCAACGAG
ACACTGGAGA TCCCCGATGT GCTGCGCGCT CTGCGGATTG CGGCCGGCAT CGCGGCTCCC
ACCGTGGCCG ACTTCATCCA CGGCGATGTG GCCCCCCTTG ACGGAAACGG CCGCCCCGCC
CCGGACGGCG TGATAGATAT GAACGACGTG CTGGTTGTCT TACGCAAGAT GCTGGGCGTC
GTGTCGTGGT GA
 
Protein sequence
MNSKRYRSDT RPAACAIRLL AVLVLLAGLA ARAEAGPALP VMVKDINTAS VPVSSSPSGM 
TANGGILFFS AGDGANGPEL WKSDGSAEGT VLVKDINAGP GPGTPQNFSV MNGITYFSAM
DSFRGVELWK SDGTAAGTVI VKDIYQGGES SNPLELTVAG NTLFFSADHP VYGKELWKSD
GTAEGTVLVA DIAAEASSTP QWLRAVNGTL FFAADDGLHG RELWKSDGTP EGTVMVKDIN
PFGGSDPGEM AVSGGILYFT ADDGENGHEL WRSDGTAEGT YLVADIAPGE ESSYPFEPVG
INGLLYFTAN DGYTGYELWQ SDGTPEGTTL VKDINPDGED SMPWGIVGMD RYVYFAADDG
VNGYELWRTD GTMGGTEMVA DIQPGMGGSM YSSPRLVNGM LLFAADDGEH GIEIWKSDGT
AEGTLMVRDI IPDAMSWPSE LMVHNGTLYF AADDGVNGTE LWKSDGTAEG TVLVRNIAPE
TASSLPYQLA VMGTTVFFAA ADTDLDFDVW KSDGTADGTV LVKEINPEGW AYLDRLMVVG
DTLYFLAEDN YGEASHGIEL WKSDGTAEGT RMIKDINPGP QGIFFPGNPN YPFSMAAVGT
TVYFPGFTAG NGHELWKSDG TAEGTVLVKD INPVFDFSSF PDSFTAMNGA VYFVADDGTH
GAELWKSDGT ADGTRMVRDI YPDGIGSSPL SLTVMNNVLY FSAAGDEGGY GLWKSDGTAE
GTTFVKDTSP FNHSLLPAYL TPVNGTLFFA AHDENAGFEL WKSDGTTDGT VLVADILPGE
GASNLRLLTG VNGTLFFVAD DGVHGEELWK SDGTPEGTVM VKDIFPGDGI SGITWIKVMN
GMLYFAADDG VNGLELWQSD GTAEGTVLVT NIVAGQGSSS PSYPVVAGNT LYFAATDGGS
GVELWKFSPD PPDGDLTGNE TLEIPDVLRA LRIAAGIAAP TVADFIHGDV APLDGNGRPA
PDGVIDMNDV LVVLRKMLGV VSW