Gene GSU0617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0617 
Symbol 
ID2685411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp652996 
End bp654081 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content58% 
IMG OID637125284 
ProductNHL repeat-containing protein 
Protein accessionNP_951675 
Protein GI39995724 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0567002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTTC TGAAGCTCAT CCGCAGGTTC GTCGGTGCGG CCGCCCTGTG CGCCCTGTGC 
GCCGGTTGCG CAGGACAACA GGTCCGGGAG GAGCGGCGCT ACTTCTGGCC GCCGCTGCCC
GAGCGTCCCA GGATCGAATG GCTCGGTGCC TACAGCAGCC AGAACGACTT CCCGAAGCAG
GGATTCGCGT CGTTCATGGC AGCCATTGCC GGAGAAGAAC AGGCCATGAG CCTGACCAAG
CCGCTGGATG TCTATGCGGA TGGCCAGGAC CGGATTTATG TGGCAGATCC GGGACTTCGC
GGCGTGGTTG TGTTCAATAT GAAAGAGCGG AGCGTGTCGA TGCTCGGCGG ACCCCAGGCG
GCTAACCAGT TTAATACCCC GGTTTCGGTC ACCGGTGATT CCCAGGGGAA TATTTATGTT
TCCGATGCGG AAAAGGGTGG GATACTGATT TTTGACAGAT TTGAGGTGCC GCGTCGTTTT
ATCGACACCA AAGCTGCTGT CAAGAGAAAC ACTGACATCG CCGTGGATGA AAAGGGTCAG
AGAATTCTCG TGGTGGATGC GCGCGAGCAC CGGATTGCCA TCCTCGACAT GCAGGGGGGG
CTGCTTTCCG CCTTTGGGAA GCGTGGCATC GAAGACGGCG AATTCAACTT CCCCGTGGCG
GTGGCCATCA ATCACAAGGG GGAGATTATC GTGGGCGATG CCATGAACGC CCGGGTTCAG
ATCTTCGATC AGGACGGGAA GTTTCTCCGC AAGTTCGGGC GCCGTGGCGA CGGGCCGGCT
GATTTCCAGA TCATGAAAGG CGTGGCCGTC GACTCGGAGG ATCATATCTA TGTGACCGAG
GGAAAAGGTC ACAAGCTCAT TATCTTCGGC ACCAACGGCG AGTATCTTCT CACCGTGGGC
GGACTCTACT CCGCCATCAC CACCGGCAAG CAGGCCCCCG GCGGATTCGT CATCCCGCAA
GGAGTCTTTA TTGACGATAA GGACGTCATT TACGTGGTTG ACCAACTCAA TCGGCGGTTC
CAGGTGTTCC AGTACATCTC CGACGATTTC CTCAAGCGCA ACCCCATCCC GGGATGGCAG
GAGTAA
 
Protein sequence
MNVLKLIRRF VGAAALCALC AGCAGQQVRE ERRYFWPPLP ERPRIEWLGA YSSQNDFPKQ 
GFASFMAAIA GEEQAMSLTK PLDVYADGQD RIYVADPGLR GVVVFNMKER SVSMLGGPQA
ANQFNTPVSV TGDSQGNIYV SDAEKGGILI FDRFEVPRRF IDTKAAVKRN TDIAVDEKGQ
RILVVDAREH RIAILDMQGG LLSAFGKRGI EDGEFNFPVA VAINHKGEII VGDAMNARVQ
IFDQDGKFLR KFGRRGDGPA DFQIMKGVAV DSEDHIYVTE GKGHKLIIFG TNGEYLLTVG
GLYSAITTGK QAPGGFVIPQ GVFIDDKDVI YVVDQLNRRF QVFQYISDDF LKRNPIPGWQ
E