Gene GSU3086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3086 
Symbol 
ID2688476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3391779 
End bp3392954 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content64% 
IMG OID637127779 
Producthypothetical protein 
Protein accessionNP_954127 
Protein GI39998176 
COG category[L] Replication, recombination and repair 
COG ID[COG0116] Predicted N6-adenine-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCGG TTATTGTGCG CAAACGTACC GCAGTCACTC CCCTTTCGGG CGAGCATGCC 
TTCTTCGCCA CCACAGCCAA GGGGGTTGAG GATGTCCTGG CCGCCGAGAT GCGCGGCCTC
GGTTTTCGCG GGGTGGCCAT AGAGCGCGGA GGAGTGCGTT TCGCGGGCGA TCTTGCCGCC
TGCTACCGGG CCAACCTCTG GCTGCGGACC GCGAGCCGCA TCCTCGTGCC GCTTGCCCAA
TTCCCCTGCG ATTCACCCCA GCGGCTATAT GACGGTGTCC GCTCCATCCA GTGGAACGAC
TGGCTCACCC CCGACATGAC CCTGGCAGTT GAGTGCAATC TTCGCGATTC GGTTCTGACC
CATTCAGGGT TCGTGGCTCT CAAGACCAAG GACGCTATAG TCGACACCAT CCGTGACCGC
CACGGCAGGC GACCCAGCGT GAATCCCAAG CAGCCGGATC TCGGGGTCCA TGTTCATCTG
GTCCGTAACG TCTGCACCGT GAGCCTTGAC AGTTCCGGCG CCAGCCTCGA CCGGCGCGGC
TACCGTACCG AGGCGGGTGA GGCCCCGTTG CGGGAAACTC TTGCCGCCGC CCTCGTGGAG
ATGACGGGGT GGGACGGCAC CGTTCCTCTC CTAGACCCCA TGTGCGGCTC GGGTACCATC
CTCGTGGAGG CGGCGCTCAA GGCGCTCAAT CGGGCGCCCG GTCTCATTCG GGAGCGTTTC
GGATTCCAGC ACTGGCCCAG CTTCGACTCT TCACTCTGGC TCCGTCTTGT AACCGAAGCA
CGCCAGGGGG AGCGCACTTC ACTTGAGTCC CCCCTCCTGG GGAGCGATCA GCAGGCTGAC
CTTCTCTCTG TCGCCGCAGC CAATGCCCGG CGGGCCGGTG TCGAGCAGCA CATTTCCTTT
ACGTCTGGCG ATGTGCGTGG CCTGACACCA CCTCCCGCTC CTGGCATTAT TCTCTTCAAC
CCTCCTTATG GCAGGAGGCT CGGCGATGAA GAAGGGCTTC GGGTCCTCTA CCGCCAGATC
GGTGACGTTC TGAAGCAGCG CTGCGCCGGC TATACCGCCT GGCTGCTCAC CGGTGGGCCC
GAGTTGGCCA AGGCGGTGGG GCTCAGGGCA TCCCGGCGGA TCGTCCTCTT CAACGGCCCC
CTTGAGTGCC GTTTCCTGAG GTTTGATCTC TACTGA
 
Protein sequence
MSSVIVRKRT AVTPLSGEHA FFATTAKGVE DVLAAEMRGL GFRGVAIERG GVRFAGDLAA 
CYRANLWLRT ASRILVPLAQ FPCDSPQRLY DGVRSIQWND WLTPDMTLAV ECNLRDSVLT
HSGFVALKTK DAIVDTIRDR HGRRPSVNPK QPDLGVHVHL VRNVCTVSLD SSGASLDRRG
YRTEAGEAPL RETLAAALVE MTGWDGTVPL LDPMCGSGTI LVEAALKALN RAPGLIRERF
GFQHWPSFDS SLWLRLVTEA RQGERTSLES PLLGSDQQAD LLSVAAANAR RAGVEQHISF
TSGDVRGLTP PPAPGIILFN PPYGRRLGDE EGLRVLYRQI GDVLKQRCAG YTAWLLTGGP
ELAKAVGLRA SRRIVLFNGP LECRFLRFDL Y