Gene GSU3222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3222 
Symbol 
ID2688295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3533249 
End bp3534487 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content61% 
IMG OID637127915 
ProductNHL repeat-containing protein 
Protein accessionNP_954263 
Protein GI39998312 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCGC GTTTCGCCGT AGCCCACCGT CCTTTCACAC AAGCTGCCCG GCTGTTCCTG 
CTCTGTTCGC TGCTCTGGAT CAGCGGATGT GCCGGTAAAA CCGGCACTGC GGGAAAGACC
TTTTTTCCGC CGCCTCCCAA CCTGCCCCGG CTCCAGTACC TGATGGGGAT TGCCAACTCG
ACCGATGTGG AAGGGAAGGA TTCATCGTTT TCCCTCTTCG GGGGATTGGC CGAGCAACGC
GAAAAGATCC GCTACATCGT GAAGCCGTAC GGCATTACCG AGGCCGGCGG CAAGCTCTAC
GTGAGCGATG TGGGAACCGC CCAGATCGTC GTCATCGACC TGCCGGGCAA AAAATTCGAG
CTGCTCAAGG GGGCTGCCGG ACCTGGCAAG CTGACCACTC CGGCCAACGT GGCCGTGGAC
AAGGACGGCT TCATCTACGT GGCCGACGCG GGCCGGAGAG AGGTGGTGGT ATTCACGCCG
GAAGGCGATT TCCTCAAGGC CATCGGCGGG GACCGGGACA TGAAGCCCGT GGATGTGGTC
GTTAGCGGCG ACCGGGCCTT CGTGCTCGAC ATCAAAAGCA GCGATATCAA AGTGTTCAAC
GTCAAGAGCG GCCAGTATCT CGAAAGTTTC GGCACAGCGG GCGGCCCCTT CGAGCGGCTC
GCCATGCCCA TCAACCTGGC CATGGACTCC AAGGGGTTTC TCTATGCCAC CAACGGGGTC
AGCGGCAGAG TCCTCAAATT CGACCGGGAC GGCAACCTGC TGCTCTCCTT CGGCCAGATG
GGTGATGGCT TCGGCCAGTT CGCCCGCCCC AAGGGAATCG CGGTGGATCC CACCGGACTG
ATCCACGTGG TTGACGGCGG ACATCAGAAC GTTCAGCTCT TTTCGGATAC GGGACGCCTG
CTCCTCTTCT ACGGCGACGC GGGCAAGGAT AGCACCGCAT CACTGAATCT CCCGGCAGGG
ATCGCCTATT CCACGGCCAA CCTGGAGTAC TTCCAGAAAA TGGCGGACTC TTCCTTCAAG
CTGGACGGCG TGGTGTTCGT CACCAACCAG GGGGGGAAAG CGAACAAGGT GGCCGTGTAC
GGGTACGGCA AACGCGAAGG GATCGATTAC GAGCAGGAGT ACGAGAAAAT CCGCAAGGAA
CTGGAAGAGC GTGCACGGAA AGCCCGCGAA AAGGAGGCCC AGGAGGGGAA GAAGGCTGGC
CAGGCCGAAC CCAAGGCTGC GGAACCCGCC GCGAAGTAG
 
Protein sequence
MKPRFAVAHR PFTQAARLFL LCSLLWISGC AGKTGTAGKT FFPPPPNLPR LQYLMGIANS 
TDVEGKDSSF SLFGGLAEQR EKIRYIVKPY GITEAGGKLY VSDVGTAQIV VIDLPGKKFE
LLKGAAGPGK LTTPANVAVD KDGFIYVADA GRREVVVFTP EGDFLKAIGG DRDMKPVDVV
VSGDRAFVLD IKSSDIKVFN VKSGQYLESF GTAGGPFERL AMPINLAMDS KGFLYATNGV
SGRVLKFDRD GNLLLSFGQM GDGFGQFARP KGIAVDPTGL IHVVDGGHQN VQLFSDTGRL
LLFYGDAGKD STASLNLPAG IAYSTANLEY FQKMADSSFK LDGVVFVTNQ GGKANKVAVY
GYGKREGIDY EQEYEKIRKE LEERARKARE KEAQEGKKAG QAEPKAAEPA AK