Gene GSU1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1788 
Symbol 
ID2686481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1951381 
End bp1952445 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content51% 
IMG OID637126468 
ProductNHL repeat-containing protein 
Protein accessionNP_952838 
Protein GI39996887 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.565326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACTAA GCCGTGGCGT ATTCCTTCTT CTATTGTTGC TCGCGCTAGC ATCACCATTG 
ATGATCGTTG GGTGCGGCGG TCCGTCGTTC TTGCCCGCCG CCTCATTGCG AGACCCATCG
GTGGATATGG CTTGGCCTCC TGCTCCTAAT CCGGCCCGGA TACGATTTCT TCGTGAAATT
TCCGGGCCGG AGCAGGTTAA GGCTGAACCA GGAGCAATAG CCCGTTTTCT TGAGTTTGTT
ACCGGCGAGC AGTTCAAGCA TGTTCCCTTT GTAACTCCCT ACGGGGTTGT CTCGGATGGC
GGAACGCTTT TGTTTGTTTC CGATTCCTCG TCTGGTGTCG TTCATCGCAT CGATCTGGCG
CGGCAAAAGG TTTCCTATAT TGTTCGGGCT GGCGATGAGT TCCTCTCAAG CCCGGTCGGA
CTCGCCCTCT CCCCTTCAGG TGATCTGTAC GTCAGCGATT CGGTCAATGC CAAAGTGTAC
GTTTTTTCCC GTGACGGAGA GTTTTTGCGT GTCCTGGCTG ATGGCCAGGT CGACTTCAAG
AGACCGGCCG GTTTGGCCGT GAACAGTAAA GGCGTTCTTT TTGTTGTTGA TGTGTTGGCA
CATAAATTGA AAGTTTTTAA CGTGAGTGGG CGTTTCTTGG GAGATTTCCC CCCTGATGAT
ATTGGGGGTA AATTAAACCT TCCCTCCCAT GTGGCCGTTG ATAAGGACGA TAAAGTCTAT
GTTACCGATG CCTTGAATTT TACGGTCAAG GTGTATGATT CAGCCCGTCG CTATCTCCGA
AGTATCGGTG AAATAGGAGA TGCTCCCGGT TCTTTCGCGA GGCCCCGTGG CGTTGCAGTC
GACAGTGACC TCAATGTGTA CGTGATCGAT GCCGCGTTTG ACAATTTTCA GATTTTTAAT
CAGGAGGGGC AATTGCTCCT TTTCGTTGGA AAACCCGGCA AGAAAAGCGG TGAGTTTTAC
ATGCCGAGCG GCATTCATAT CGATCGCAAC GATCGAATCT TCATCTCTGA TTCGTACAAC
CGGCGGGTCC AGGTATTCGA ATACCTGAAA GAGGAAAATC GATGA
 
Protein sequence
MRLSRGVFLL LLLLALASPL MIVGCGGPSF LPAASLRDPS VDMAWPPAPN PARIRFLREI 
SGPEQVKAEP GAIARFLEFV TGEQFKHVPF VTPYGVVSDG GTLLFVSDSS SGVVHRIDLA
RQKVSYIVRA GDEFLSSPVG LALSPSGDLY VSDSVNAKVY VFSRDGEFLR VLADGQVDFK
RPAGLAVNSK GVLFVVDVLA HKLKVFNVSG RFLGDFPPDD IGGKLNLPSH VAVDKDDKVY
VTDALNFTVK VYDSARRYLR SIGEIGDAPG SFARPRGVAV DSDLNVYVID AAFDNFQIFN
QEGQLLLFVG KPGKKSGEFY MPSGIHIDRN DRIFISDSYN RRVQVFEYLK EENR