Gene GSU3022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3022 
Symbol 
ID2686805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3316979 
End bp3320353 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content53% 
IMG OID637127715 
Producthypothetical protein 
Protein accessionNP_954064 
Protein GI39998113 
COG category[S] Function unknown 
COG ID[COG4627] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTGTC ACATTGCTTT CGAAAGCGAC TACTCTTTCG CGCTACACAA CCTGCTCTTG 
AAACACTTCG ATATCAACGA GCATATTTTT CTGATATGCG GCAGAGCCTC ATGGGATTCG
ACGAAACAAT TTCAATACAG AAACAGCGTT AAGGTCGCAG ACCTGAACTC TCAGGAAGTC
CGGCGACTCC TCTGCGAGTC GGAACAGATT ATTATTCATG GCTTATTTTT CGGGGAAGTG
GTCGACTACT TTCACCGCAA CCAGGATCTT CTGCCCAAGA CCAGTTGGAA GCTCTGGGGC
GGCGACCTTT ACTGTTACAG GAATGCAGGA ACGACAGACT TTCCTGCTGA ATATGAGGAG
AAAAGGAGAC AGGTCATCCG GAACTTCGGT CGGATAATCT GTGTCTTAGA GGAGGATTAT
CAACTCGCCC GACAGGTATA CGACACCAAG GCCACCCGTG CACATGGGTT CTACCCGTTG
CTGCTCAATT ACGACTACCT TGATTCGCTA CGAAAACCTC GCGCTCCGTC GGAGATAATC
ACCGTTCAAG TCGGCAACTC TGCAACACGC GAAAACTGCC ACGAAGAAGC ATTCGAACAC
CTCATCCATC TCGACAACGA CAACATGAGA GTCTGCTGTC CCCTCTCCTA CGGTGACAAA
TCCTATGGCA ACGAGATTAT TCGGTTGGGG ATGAAACTGT TCGGAAGTCG GTTCTGCCCC
CTGACCGATG TTCTCCCATT CAAGGAGTAT TCCGACCACC TCAGCTCAGT TGATGTCGCC
GTTATGAACC ACAACCGTCA GCAAGGGCTC GGCAACACAG TAGCGTTGTT GTATCTCGGC
AGGAAAGTAT ATATGCGGCA TGACACGACG TCCTACAGTT GGCTGATGAA ACACGGAATC
ACCGTCTACG ATATGCGGAA TTTGGGGCAT GCTTCATTTG CCGAGTTTTC GCATATTGAA
GAGGATGTGG CGGCGCGAAA CACAGAGATC ATCGGCAGGG AATTCTCGGA CGATGCATGC
GTAAAGCTAT GGAAGTCAAT TTTCAGCTCA AAGGGAAGAC AATCTAAGAT GACGGTACCG
AATGAGAAGG GTTCACTAAT TCAAAAACTC AAGCACCAAT ATTTCCCCGG CTCCAACTTC
GTAGGCCCTT ACGAGCAAAA CGGCCACGCC CCATCCATCT ATGAAGAGCA GTGGCTGTGG
GCACATCGCC ATCTGATCAA AGGAACCGTC CTTGACATGT CCACACCCCG TTATTGGCAC
GAATTCATTC ATACCATGGA ATCGGTTTCT CGTGTCGTCA TCAGCGAGAT GGGATGCGAC
ACCGTAACCA AATACGGCAA GAGCTCATCG GTAGACATAG TAGCCGATTT CTGCGATCCG
AATCTGCCAG TGGCTCCGGA AACGTTCGAC ACCATCCTCT GCCTGAGCAT CCTGGAACAC
TGCACCAATC CCTTGTCCAT GGTAAATAAT CTTTACAAGT TACTCCGCCC CGGAGGCACC
GTTTTCTTCT GGGCACCCTT CGCCTACATC GATGGGCACC TTGAGCCTGA CTATTGGCGT
TTCGGGCGTG ACGGCTTCCG TCTTCTGGTC AAACAGGCTG GGTTTGAAAT CATGGAAGAA
GGAAACTTCG GGGACCTCGG AAAATATTTC CTCCAAGAAT TTGGATTTGA TGCATCAGCA
CGGAACGGTC ATCGTGGTGT GCCCTGCGCC AACTGGATTA TCTGCAAAAA ACCGCAGGAT
ACCAGGACCC TCCCCAAGCC TACACATGGG ATCTCGACGA GCGCGGGGAA CTCACCCCTC
CGCCTTTACG CCGGCGACAT CCCCAACCGC CCCGAGTACG AAGGCTGGCT CGGCCTCTCG
CTAACCCAGG AAAACGACCG CCATATCCGG CACGACATCA CCCGACCCCT TCCCTTCGCC
GACAACTCGG TGGACGCTTT CCAGGCTGAA GATGTGCTGG AGCATATCGC CTATGACCGG
ATCGTCCCCG TTCTTAACGA AATCCATCGT GTACTCAAAC CGGGCGCAGT GTTCCGGCTC
TCCGTCCCCG ACTATCGTTG CGATATCCTT GACGCCAGAA CGGAAAAGGA CGCAACGGGA
GCGCCCGTCT TCGATCCCGG CGGAGGAGGT ACCAGAGAAA ATCCGGGGCA CGCCTGGTTC
CCCCGCTACG AGACGGTTAA GACCCTGGTC GAAAAATCTA ATTTTGGAAC AGCGGGAGAA
ATTTCATTCC TTCACTACTA CGACGAGCAT AACCTTCCGG CGATGCGGGA GATCGACTAT
GCGAAGGGAT TTATTCAGAG GCCCCCGGAC CACGACCCGC GGGTGCAAAC ACCCTACCGT
CCGATGTCTC TAGTCGTAGA CCTGACAAAA AGAGACGTAA GAATATTTTC TTCATTCAAC
AACTCCGCAT CTCACGCGAA TACGAATGTT GGCGGCACCG ATGCTCACTC CATCGCCGAA
ACCGCAAGCC GAATATTGCA GGCGGGTGAA AAGCTCTATT CTGAAGGATA CACCGCAGAT
GCAGAGGGAT GCTTCAACGC TCTCCTCGTC ATGCAAGAGC CGCTTGCCAC CGCCCACAAC
AATCTTGGCG TAATCCACCA GGGCCGCCAT GAACTCGATA ACGCTCTTGA ACACTTCAGA
TCAGCACTGG CCCTCATGCC GTCACATCGG GAGGCACGTG AAAACATCAA AGCCACCCTT
TCATCTGAAA GCATCTATCC CTATGAGGGG AATCGGTCCA GATCCAATCA TTACAATGAA
GACTACTTCA ATTGGCAAAA GAACATCGGT GCGTTCGGCG GAAGAGCCAA CCTCTTTAAG
TTCAGCGAAT TCATAACACC TTCCGACACT ATACTCGACT TCGGCTGCGG AGAAGGATAT
CTCCTCTCAA ACATCACCTG TGCAAGGAAG ATCGGAGTAG AACTGAACGA AGCAGCCCGA
AGAACCGCAT GGGCACAGGG GGTAGAAGCA TTCGCGTCTC CGGAGGAGAT CCCCTTTGGA
ACTGCAGACC TGGTGATCTC CAACCACGCG CTGGAACATG TGCTGTCACC GCTCGAAACG
CTCCAAGCCC TGATCAAGAC GCTCAAGCCG GGGGGGATGA CCGTTTTTGT CGTCCCCCAC
CAGGATACAC GGGAAGAATA CAATCCCGAC GACATCAACA TGCACCTCTA CACCTGGAAC
CAGTTGACGC TGGGCAATCT GTTCCGGCAG GCAGGATTCT CTGTTGAACG CGTCGAGGCG
ATCCAGCACC AATGGCCGCC CAATTTCGCA GAGGTCTATG CCCAGGTCGG CGAAGCGGAA
TTTCACCGGA TATGCCGGCA AACGGCTATT CAGAACAATA ATTACCAGAT TCGGATAGTT
GCCCGCAGAA AGTGA
 
Protein sequence
MICHIAFESD YSFALHNLLL KHFDINEHIF LICGRASWDS TKQFQYRNSV KVADLNSQEV 
RRLLCESEQI IIHGLFFGEV VDYFHRNQDL LPKTSWKLWG GDLYCYRNAG TTDFPAEYEE
KRRQVIRNFG RIICVLEEDY QLARQVYDTK ATRAHGFYPL LLNYDYLDSL RKPRAPSEII
TVQVGNSATR ENCHEEAFEH LIHLDNDNMR VCCPLSYGDK SYGNEIIRLG MKLFGSRFCP
LTDVLPFKEY SDHLSSVDVA VMNHNRQQGL GNTVALLYLG RKVYMRHDTT SYSWLMKHGI
TVYDMRNLGH ASFAEFSHIE EDVAARNTEI IGREFSDDAC VKLWKSIFSS KGRQSKMTVP
NEKGSLIQKL KHQYFPGSNF VGPYEQNGHA PSIYEEQWLW AHRHLIKGTV LDMSTPRYWH
EFIHTMESVS RVVISEMGCD TVTKYGKSSS VDIVADFCDP NLPVAPETFD TILCLSILEH
CTNPLSMVNN LYKLLRPGGT VFFWAPFAYI DGHLEPDYWR FGRDGFRLLV KQAGFEIMEE
GNFGDLGKYF LQEFGFDASA RNGHRGVPCA NWIICKKPQD TRTLPKPTHG ISTSAGNSPL
RLYAGDIPNR PEYEGWLGLS LTQENDRHIR HDITRPLPFA DNSVDAFQAE DVLEHIAYDR
IVPVLNEIHR VLKPGAVFRL SVPDYRCDIL DARTEKDATG APVFDPGGGG TRENPGHAWF
PRYETVKTLV EKSNFGTAGE ISFLHYYDEH NLPAMREIDY AKGFIQRPPD HDPRVQTPYR
PMSLVVDLTK RDVRIFSSFN NSASHANTNV GGTDAHSIAE TASRILQAGE KLYSEGYTAD
AEGCFNALLV MQEPLATAHN NLGVIHQGRH ELDNALEHFR SALALMPSHR EARENIKATL
SSESIYPYEG NRSRSNHYNE DYFNWQKNIG AFGGRANLFK FSEFITPSDT ILDFGCGEGY
LLSNITCARK IGVELNEAAR RTAWAQGVEA FASPEEIPFG TADLVISNHA LEHVLSPLET
LQALIKTLKP GGMTVFVVPH QDTREEYNPD DINMHLYTWN QLTLGNLFRQ AGFSVERVEA
IQHQWPPNFA EVYAQVGEAE FHRICRQTAI QNNNYQIRIV ARRK