Gene GSU0286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0286 
Symbol 
ID2686797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp315412 
End bp317049 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content64% 
IMG OID637124952 
ProductHEAT repeat-containing PBS lyase 
Protein accessionNP_951346 
Protein GI39995395 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0933039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGCAA ACGGAGGGAA CGCAACGGGC AGGCCTGCGG CAGGAGGCGG GCAGTACGGG 
ATCGTGCTGG CTGAACTCTA TCGGGCGCTG AAGGCCCTGA CGTTTTATCC TGAGGGGCAT
CCTCAGCGCG CCGAAGTCCT GGCACGGGCC CATGCCGCGC TCCGGGGAAT CCTCTGGGGA
ACAGAGCTCG TGTTCGTCAT CGGTAAGAAC GGTTTCACCG CCTCGGAGGG CGGGGCTTTT
GTGGAGCCGA CAGCAATGAC CCAGGCCCTT GCGCGGGAGC TGTTCATCCG ACGGGTAAAG
CGGCTCACGA TCCTGCCCGA CGTCGGCGAG GCCGACCTGT CCTTGTTTCT CACCCTTTTA
TCCATGGACC ATCGCGACGT CCACGAGGCG GGAGGGATAG AGGCGCTTAT GGCCCAACTC
GGCCTGACGA CCATCTGGGT CAACGAAGTG GACCTTGACG AAATCCGGCG CAAGCGGGCG
GTCGTCGAGC AAACCCGTTC TGCTGCGGCC CATGACGGCA CCTCTGACAA TGTATTGTCT
GCCATCGAAA AGGTGGCGGA CCAGGACGGG ACACCCGAAG AGGGCAGGGA CGCGGCAGGA
CTGCAGGAAG AGGATCGGTT GGAGGCGGAA GCCATACTCG GCCGTATGGA GCGGGAAACC
AGCGACGATC GGTACCGGGA GCTCGCCCGC CTGTTGTCGG CCCGTTGCGC GGAACTCGGC
GACCGGGGTG AGTTCGAGCG GATTCTGTGG GTGTTGGTGA ACCTGCACCG GCATGCGAAG
AGTGAAGCGG CGAGCGCCGC GCGGCGGGGG TATGCCCTTC TTGCCTTCGA GAAGGCGGCC
GGGGGGCCCA TGCTCCCGTT TCTCGTAGGG CGCCTGGAGG AGCGCGGGGA GGAGCGGGAG
ACCCTCGTCG ACCTGTTCCG GGAGATTGGC GAACCAGCGG TTGCGCTGCT GGTTGAACGC
CTCACGCTGG CAGAGAGCAG GAATGCCCGG AAGCTGATCA TCGATGCACT CGTGGCAATC
GGTGCGGAGT CGGTGGCTCT TCTGACCGAG CATCTTGTCG ACCGTCGCTG GTACGTGGTG
CGGAACGCGG CGGTCATCCT CGGCGAGATC GGCGATCCTG CAAGCGCCGA ACCCCTGCGG
GCCTGTTTAG TTCATACGGA CGTAAGGGTA CGCCGGGAGG CGCTCAGGAG CCTCGTCAAG
ATCGGCGGTG AACAGGCAGA GGACGCGATT ATCGGCCTGC TCGCAGCGGA GGATAATCTG
ACGAAGCGTC ATGCCGTGAT CTCGCTTGGT CTCCTGAAGA GCCGCCGCGC GGTGGAGCCT
CTTTGTGCTC TCATCGAACG CCGCGATCCC TTTAAAAAAT CGCTTGCCTT CAAGAGGGAC
GTCATCCAGT CCCTCGGCCG GATCGGAGAC CGCCGGGCGG TGCCGTCTCT GCTCCGGCTG
CTGGAGCATC GGCCCTGGTT CGGCCGACGC CGTTGGGACG ACGTCCGGAT GGCGGTGGTA
ACCACCTTGG CCCAGATCGG TGACCCGGCC GCTCTGGGAC TGCTGGATTC GCTGGCAGCG
AAAGGCGGCA TGCTTGCGGC GGCCAGTGCC GATGCAGCAG AAGACATCCG CCGTCGAGGG
GGGGTCGTTA ATGATTAA
 
Protein sequence
MGANGGNATG RPAAGGGQYG IVLAELYRAL KALTFYPEGH PQRAEVLARA HAALRGILWG 
TELVFVIGKN GFTASEGGAF VEPTAMTQAL ARELFIRRVK RLTILPDVGE ADLSLFLTLL
SMDHRDVHEA GGIEALMAQL GLTTIWVNEV DLDEIRRKRA VVEQTRSAAA HDGTSDNVLS
AIEKVADQDG TPEEGRDAAG LQEEDRLEAE AILGRMERET SDDRYRELAR LLSARCAELG
DRGEFERILW VLVNLHRHAK SEAASAARRG YALLAFEKAA GGPMLPFLVG RLEERGEERE
TLVDLFREIG EPAVALLVER LTLAESRNAR KLIIDALVAI GAESVALLTE HLVDRRWYVV
RNAAVILGEI GDPASAEPLR ACLVHTDVRV RREALRSLVK IGGEQAEDAI IGLLAAEDNL
TKRHAVISLG LLKSRRAVEP LCALIERRDP FKKSLAFKRD VIQSLGRIGD RRAVPSLLRL
LEHRPWFGRR RWDDVRMAVV TTLAQIGDPA ALGLLDSLAA KGGMLAAASA DAAEDIRRRG
GVVND