Gene GSU0989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0989 
Symbol 
ID2685663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1065932 
End bp1067824 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content67% 
IMG OID637125659 
ProductNHL repeat-containing protein 
Protein accessionNP_952043 
Protein GI39996092 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02242] phage tail protein domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCACG GCCTCGACCG CCGTCTGGGG CTCGCGGGAC GCATCCGCCC CGACGCCCTG 
GTGGCTGATC GCTGCGGAAC GCTGATCATG CTGGCCGGCG AGCGCTTCTA TCGTCTGGAC
CCCATGTCCG GCCGACTGGA GCGGATACCC TGTCTCGGCG GCAGGGGGGA CAGGGCGGGG
GAACTGAGTG GACCCCGGGC CATGGCTCTG GGGAGCCGCA ATCTCTACGT GGCCGATACG
GACAACAACC GGGTCTGCGT CTTTGCCACG GTTAACTGGC AGGTGCGCCG GTTTATCGGA
GCCGAGAACC CGGCAGGGGA ACCGGCGGCC GGAACCGGTC CGGGGGAGTT CGACCGCCCC
CTGGATCTGG CGGTGGACCC CTGCGACAAC CTCTATGTGC TCGATGCCGG CAACCGGCGC
ATCCAGCGTT TCGATTACCA TGGCGAACCC GTGCCCCATG TGCCCCCCTT CGGCGCCGAC
CGGCTGAAGC AGCCGGTGGC GCTGGCGCTG GGCCCGGCTC CCTCCCCGTC CGGCGGGGGG
GCCCTGGTCC ACTGCCTCGA TACGGGGCTC ACCGCCATCG TTACCTTTGA CGACCAGGGC
CGGTTCCTGG GCACCGTCGG CCTGGACGAC CTCGGCTTCG AGCCTGCCGG TCTGGCGGTG
GACGCCGACG GCAAATGGTA CGTCTCTGAT CGGGAGCGGT TCATCTATGC CATCAGGTCG
GCAGGTGACT GGAGCCCTCT GGAGGAGTAC GAGGGGAAGG CCCTGCGGCT GTTCGCCGGT
CCCGGCGGGG AGTTTTACGC CCTGGAAGAT GGGGAGGTGG CCCGCCTCAC CCGTCGCCGT
CGCTATCCGC CGGCAGGCTC GTGGACCGGC AGCGGCCCGG TAACGGGCAT CTACACTTCC
CGCTCATTCG ATACCGGCGA CGGCCGCCTC TTCTGGCACC GGGTGACGCT GGACGCGACG
GTCCCCCCAA AGACCCAGGT GCGACTTTCC TACTTCATCT ACGAAACCGG CCGCGATCCG
GAGCTCCTGC CGGCCGACGG CGAGTGGCGA AGTTTCCCGT CCAATCCGGC CGACGCCCTT
TTTGAGCGGA AGGAGGGACG GTATCTGAGG GTGCGCCTGG AACTGATCTC CGAAGACCGG
CATGCCACCC CCACAGTGGT GAGCCTCCGC CTCCAGTTTC CCAAACAATC GTATCTCCGC
TACCTTCCGG CCGTCTTCCA GGACGACGAG CGGGGGCGTG ATTTCCTGGA GCGGTTTCTT
TCCCTGTTCG AGAGCGTCCT TTACGACCTG GAGCGCGAAA TATTCACGAC CCGGCGCTAT
GCCGACCCCT GTGCCGTACC GGCCGGCTTC CTTCCCTGGC TCGCCTCGTG GCTGGCGCTC
CCCGATGCGG ACCAGTGGCT CGAAGATGGC GGGGCACGGC TGCGCACACT GATCGCCCGG
GCCAATGAGC TGTACCGTTG TCGGGGCACC CGGGGTGGAC TGGCGGAACT CATCACTCTT
TACACCGGCA AGGAACCGTG GATTGTGGAG GCGTTCCAGT TGGACCGCAT CCGGGGCCGG
AGCGAGTGGC GCGAAACCAT GAGGCTCTTC GGGGAAGATC CCTACCACTT CACGGTTCTC
CTGCCGCCGG GCGGCGCAGG GAGGACCGAA ACCGTAAAGC GGATCGTGGC GCGCGAGCGC
CCGGCCCACA CCTGTGCCAC GGTAGTTGCC CTGGAGAACC TGTTCCGGCT TGGGGGGCAC
ACCTACCTGG AAGTCAATAC CAACCTGAAC CAGCCCCTCT TTGCCCTGGA GACCTCCTCG
TCGCTCGCCC GGCAGACCTA TCTGGCCGAC GGCGAGAAGG CGGGGCAGGC GCAGGTCCGG
GCGCGACAGG GCATGGATAC ATTGTTTGAG TGA
 
Protein sequence
MVHGLDRRLG LAGRIRPDAL VADRCGTLIM LAGERFYRLD PMSGRLERIP CLGGRGDRAG 
ELSGPRAMAL GSRNLYVADT DNNRVCVFAT VNWQVRRFIG AENPAGEPAA GTGPGEFDRP
LDLAVDPCDN LYVLDAGNRR IQRFDYHGEP VPHVPPFGAD RLKQPVALAL GPAPSPSGGG
ALVHCLDTGL TAIVTFDDQG RFLGTVGLDD LGFEPAGLAV DADGKWYVSD RERFIYAIRS
AGDWSPLEEY EGKALRLFAG PGGEFYALED GEVARLTRRR RYPPAGSWTG SGPVTGIYTS
RSFDTGDGRL FWHRVTLDAT VPPKTQVRLS YFIYETGRDP ELLPADGEWR SFPSNPADAL
FERKEGRYLR VRLELISEDR HATPTVVSLR LQFPKQSYLR YLPAVFQDDE RGRDFLERFL
SLFESVLYDL EREIFTTRRY ADPCAVPAGF LPWLASWLAL PDADQWLEDG GARLRTLIAR
ANELYRCRGT RGGLAELITL YTGKEPWIVE AFQLDRIRGR SEWRETMRLF GEDPYHFTVL
LPPGGAGRTE TVKRIVARER PAHTCATVVA LENLFRLGGH TYLEVNTNLN QPLFALETSS
SLARQTYLAD GEKAGQAQVR ARQGMDTLFE