Gene GSU1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1684 
Symbol 
ID2687257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1844111 
End bp1845568 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content62% 
IMG OID637126365 
Producthypothetical protein 
Protein accessionNP_952735 
Protein GI39996784 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCAC CGGCAGCGCT GAAGAGAATA TCTGCAACGG TTTGGGAGCT TCCCGTAAGC 
TACAAAAAGG GAATGCTCGT TCCCGCCCGC ATCATTGCCA CGGAAAAACT CATCAACGCC
ATGGATGCTG GCGTTTTCGA GCAGGTATCC AACGTGGCCT GTCTGCCGGG CATCCAGAAA
TACGCCTTCT GCATGCCTGA CGGCCACTGG GGGTACGGCT TTCCCATCGG CGGTCTGGCG
GCCATGGACC CGGAGACGGG GGTCATTTCT CCCGGCGGCA TCGGCTTCGA TATCAACTGT
GGCATGCGGC TGGTCCTGAC CACTCTCACC TACGAAGAGG TGAAGCCGCG CCTGCGCGAA
CTGGTGGATG CCCTCTTCTA CCGGGTGCCG GCCGGGGTCG GGAGCCATGG TTTTGTGCGG
CTGTCCCACG ATGAGTTCTG CCGCGTGGCC GAGCAGGGCT CATCGTGGTG CCTGAAACAT
GGCTATGCCT GGCCCGAAGA CCTGGAGATG ACCGAAGAAC ATGGCTGCTT CACTGGAGCC
GATGCCACGA AGGTCAGCCA GAGAGCCGTT GACCGGGGAT ACAACCAGAT CGGGACCCTC
GGCTCGGGCA ACCACTACCT GGAAGTCCAG GTGGCCCGCC CCGAGAACAT TTTCGACGAG
GATACCGCCC GGGCATTCGG CATTACCGTG CCGAACCAGG TGGTGATCAT GTTCCACTGC
GGCAGCCGCG GCTTCGGCCA CCAGGTGGCC ACCGACTATC TCCAGCTCTT CCTGTCGGTC
ATGGAGAAAA AGTACGGCAT CCGCACCAAT GACCGCGAAC TCGCCTGTGC CCCCTTCCGC
TCCCGCGAGG GGCAGGACTA CTTTGCCGCC ATGAAGTGCG CGGTGAACAT GGCCTTTGCC
AATCGCCAAG TCATCCTCCA TCGCATCCGC GAGGTCTTCT CGGACGTATT CGGTCGCGAC
CCGGGCGATC TGGGCATGGA TATGGTCTAC GACGTGGCCC ACAACACCGC CAAGCTGGAG
ACCCACCTGG TGGACGGGAA AAAACGCGAA CTGCTGGTTC ATCGCAAGGG GGCCACCCGG
GCCTTTGGCC CCGGCATGGA GGGGATTCCC GCCCGCTACC GGGAAACGGG CCAGCCGGTC
ATTATCGGCG GCAGCATGGA AACCGGCTCC TATCTGCTGG CAGGCGACCC GGGAGGCGGC
GACACCTTTT TCACCACGGC CCACGGCAGC GGCCGGACCA TGAGCCGCCA TCAGGCGAAA
AAGTTGATCA AGGGACAGAA GCTGCAGCGG GATATGGAGG AGCGCGGCAT TTACGTGCGG
ACCGCGTCCT GGGGAGGACT GGCGGAAGAG GCGGGGCAGG CCTACAAAAA CATTGATGAC
GTGGCCGAGG CCACTGAGTT GTCGCATCTG AGCCGGCGGG TCGCACGGCT CGTGCCCATC
GGAAATGTCA AAGGTTGA
 
Protein sequence
MAAPAALKRI SATVWELPVS YKKGMLVPAR IIATEKLINA MDAGVFEQVS NVACLPGIQK 
YAFCMPDGHW GYGFPIGGLA AMDPETGVIS PGGIGFDINC GMRLVLTTLT YEEVKPRLRE
LVDALFYRVP AGVGSHGFVR LSHDEFCRVA EQGSSWCLKH GYAWPEDLEM TEEHGCFTGA
DATKVSQRAV DRGYNQIGTL GSGNHYLEVQ VARPENIFDE DTARAFGITV PNQVVIMFHC
GSRGFGHQVA TDYLQLFLSV MEKKYGIRTN DRELACAPFR SREGQDYFAA MKCAVNMAFA
NRQVILHRIR EVFSDVFGRD PGDLGMDMVY DVAHNTAKLE THLVDGKKRE LLVHRKGATR
AFGPGMEGIP ARYRETGQPV IIGGSMETGS YLLAGDPGGG DTFFTTAHGS GRTMSRHQAK
KLIKGQKLQR DMEERGIYVR TASWGGLAEE AGQAYKNIDD VAEATELSHL SRRVARLVPI
GNVKG