Gene GSU3417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3417 
Symbol 
ID2686608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3759521 
End bp3761050 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content67% 
IMG OID637128112 
Productdioxygenase, putative 
Protein accessionNP_954457 
Protein GI39998506 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGAA GGGACTTCCT TAAAACCGCC GGACTGACCG GCATCGCCCT GGGGCTCCCC 
GGCTGCGCCC GGAGCCTCCC CTTCGGCCGG GACGTCTTTC CCGACTTCGG TGACGATGCC
CGCCCCTACC TGGGGCTTGC CACGTCGCTG CGGGAGGAGC ACGACTACGA GGCCCGGGTG
GAAGGAACCA TCCCCGCGGG GCTGCGCGGC ACCCTCTACC GCAACGGCCC GGCCCTCTTC
GATCGGGGTG GGATGCGCAA GCGGACCCTC CTGGACGGGG ACGGCATGGT GCAGGCGTTC
CGTTTCGGCG ACCGTGGGAT CAGATATGCG AACCGCTTCG TGCGGACCCG GAAATTCGTG
GAAGAAGAGG CTGCCGGCCG CTTCCTCCAT CCCACCTGGA GCACCCAGGC CCCCGGCGGA
ATCTGGACCA ACGTCTGGCC CACGGAGCGG CTCCTGAGCC AGGCGGGGAT TACGGTCTTC
CCGTGGCGGG GACGGCTTTA CGCCTTCGAC GAATCATCTT TCCCCTACGA ACTGGACCCG
GATACCCTGG CAACCGTGGG CGAAACCACC TTTGGCCTGC CGCGCGACCT CACCACTTAC
TCGGCCCATG GCAAGTTCGA CCCGGTCACG GGAGAATGGC TCCACTTCGG CATCCGCTAC
GGCCCCCGAA CCTTTGTCCA CCTGACCACC TTCAATGCGG ACGGCACCCT CCGCCGCCAC
CGGGCCCTGG AGCTTCCCCG CGCGATCTAC ATGCACGACT GGTTCGTAAC CGAGCGCCAC
GTGGTGTTCC ACCTCCACCC GGTGGAAATC GCCTACTGGC CCTTCCTGCT CGGCATCCGG
AGCATGGCCG AATCCCTCCG CTGGCGCCCC GAACGGGGAA CGATCCTCAT GGTGGCGGAG
CGGGACGGCG AGGCGCCGCC ACGCCTGGTG GAAACCGAGG CATGCTATCT CTGGCACACC
TTCAACGCGT GGGAAGAACG AGGGAAAATC ACTGCCGACT TCGTGGGGTA CCGCAACCCG
GACCATTTCA TCAGCGACGA TCCGGTCATA ACCGCGGTCA TGCTGGGACG GCGGGGAACC
TATTCCTATC CCGGCGAGGT CATGCGCTAC CGGATCGATC CGGCCAGGGG AACGGCCGCC
CGCGAGGTGC TCCATCAGGG GAGCTGCGAG TGGCCCCGCA TCGACGAGCG GCTCCGTTGC
CGCCCACACC GCACGGGGTA CATGCTCCGC TGCCTTCCCG GCGAATTCTT TTGGTCAATC
GTCATGGGGC TCGACCCGGT TACCGGCCGC ACCGACGAGT ACAGCTTCGG CCGGGGCGTC
TACTGCACCG AGCCGGTCTT CGCGCCGCGG CCCGACACCC TCGCCGGCGG CCCCGGCTGG
CTCCTGGTCG AACTCTATGA CAGCCGCACC CGCACCAGTT CACTGGCCAT CCTCGACGCC
GACAGGATCG CCGATGGCCC CCTGGCCCTC GTTCGACTCA CCCACCACGT CCCCTTCAGT
TACCACGGCT GGTGGCAGCC TGCTTCTTAG
 
Protein sequence
MNRRDFLKTA GLTGIALGLP GCARSLPFGR DVFPDFGDDA RPYLGLATSL REEHDYEARV 
EGTIPAGLRG TLYRNGPALF DRGGMRKRTL LDGDGMVQAF RFGDRGIRYA NRFVRTRKFV
EEEAAGRFLH PTWSTQAPGG IWTNVWPTER LLSQAGITVF PWRGRLYAFD ESSFPYELDP
DTLATVGETT FGLPRDLTTY SAHGKFDPVT GEWLHFGIRY GPRTFVHLTT FNADGTLRRH
RALELPRAIY MHDWFVTERH VVFHLHPVEI AYWPFLLGIR SMAESLRWRP ERGTILMVAE
RDGEAPPRLV ETEACYLWHT FNAWEERGKI TADFVGYRNP DHFISDDPVI TAVMLGRRGT
YSYPGEVMRY RIDPARGTAA REVLHQGSCE WPRIDERLRC RPHRTGYMLR CLPGEFFWSI
VMGLDPVTGR TDEYSFGRGV YCTEPVFAPR PDTLAGGPGW LLVELYDSRT RTSSLAILDA
DRIADGPLAL VRLTHHVPFS YHGWWQPAS