Gene Sama_1674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1674 
Symbol 
ID4603925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2046154 
End bp2048469 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content58% 
IMG OID639781037 
Producthydrogenase maturation protein HypF 
Protein accessionYP_927550 
Protein GI119774810 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTAA AGATAAGCCT TCACATCACC GGCATAGTTC AGGGGGTTGG GTTTCGGCCC 
CATGTATTTC GCCTCGCCCA CGAGCTCGGG CTTGGCGGCA GTATCCGTAA CGATGCTGAA
GGCGTCTTTA TTCGTTTGAT GGGTGCCCAG CATAAAGTCG ATGACTTCAT TGAAAAGCTG
CGCAGTGCCC CGCCGCCACT GGCGCGGATA GACGGTTTCA CCCTGATAGC CGATGACGGT
TCTGATCTTG ACGAAACGCG CTTCGTGATT GAAGAAAGCC AGGCCGGCGG CGAGGCTCAG
GTTGTGGTAT CACCGGATAA AAGCATGTGT CCGGACTGCC TGGCTGATAT CCGCAACCCA
AGCGACAGGC ACTACCGCTA CCCCTTTACC AACTGCACCA ACTGCGGCCC CCGCTACACC
CTGATAAAAG CCTTGCCCTA CGACAGAAAA CACACCTCGA TGGCGCACTT TGCCATGTGC
GCTCAGTGTG AAGCCGAGTA TAAAAATCCG CTGGACCGAC GCTATCACGC CCAGCCGGTG
AGTTGTCCAA ACTGCGGGCC CAAGCTCAGT CTCACCGGTC CGGACGGCAG CCTTATCAGC
AGCGATGCAG AGTTCTGTCT GGATGAGTGT GCGCGACTGT TAAAACAGGG CGCCATATTC
GCTATCAAAG GGCTTGGCGG CTTTCATCTG GTGTGTGATG CCACCCATGA GGAGGCCGTA
AGCAGCCTGC GAACGCGTAA AGTTCGCCCG GCTAAGCCCC TCGCAGTCAT GGTCAAAGAT
ATTGCCATGG CCAGAAGCTT CGCCGATGGC GCACAGGAGG AATGGCAACT GCTCTCTTCC
CGGGAGCGCC CCATCGTCAT TATGAAAAAA GCGGCTGAGC AGCATCGCGC AGTGCCAAAA
GATACCCAGC ACCGCGAACT TGCCGCCTCG GTGGCACCCG GCATCGACCG GATTGGACTT
TTTTTACCCT ACACACCACT GCATGCCTTG CTGCTGGACA GTGTTGACCG CCCGCTGGTC
GCCACCAGCG CCAACCGCAG TGGTGAGCCA ATCATCATAG CGGCTGTAGA TATCCACCAA
AAACTCGCAG GCGTGGTGGA CTTTATTCTC GACCACGATC GCCCCATCGT ATCGGGGTGT
GATGACAGCG TGGTCCAATG GTGTGCCGGA CAGCTCCAGG TTATCCGTCT CGCCAGAGGC
TATGCGCCGC TGGCAATGCT CAGCCCTAAA CCCAGCAAGT CCCATCTGAT GGCGGTGGGG
CCTCAGCAAA AAAATACCCT GGGATTTGGC CTTGGCCATA ACCTCTTCTT AAGCCCCCAT
ATCGGCGATC TCTTCAGTAT TGAGGCCGAA GATTATTTTA TTCGAACACT GGAGAGTTTT
AAGCGCCTCT ATCAGGTAGA GCCTGGCTGC GTGGTAGCCG ACCATCACCC GGATTATGCC
CCGAGCCGCT TTGCCCGGCG CTATGCCCAA AGTGGCGATC GCGCCATTGA ACTGCTCACT
GTGCAGCACC ATTTTGCCCA CATATTGTCG GTGATGGCGG CCAATTCCCG CAGCGAACCC
GTTCTGGGAT TCAGTTTTGA TGGCACGGGT CTCGGTGACG ATATGAGCCT GTGGGGCGGC
GAAGTGCTGC TGTGCAGCGC AACCCACTTC GAGCGTCTCG GAGGACTGAT GCCCTTTGCC
CTGCCCGGCG GGGACAAGGC CAGCAGCGAA CCCTGGCGGG TGCTGTTAAG CCTGCTCGCG
GGTCATCTTT CGGCCTCACA GCTTCAGGGA CTTCCCTGTT TTGCCCATTT AAGTCAGCCC
ATGCTGAACA ACCACCTCAA AGTCATAGGG CGTCCCGGCA CCTTGCAAAG CTCATCCATG
GGAAGGCTCT TTGATGCGGC CGCCGCCTTG CTGGGTATCT GCCAACGAGT CGAGTTTGAA
GGCCAGGCAG GCATGTTACT GGAAGCCTTT GCCGCCCGGG CAACCGGCCC CATCCCCACT
CTGGTTCTGG AAGATAAACA GGGCCAGTGG GACGGTGCAG CGCTGCTGGT GTCTATGCTG
GAGCAGATGG ACACGTTGGA TACCTCCCCA GAATGCCTGG CGAACGCTTT TATCAATGCC
ATCGCCGATG CCATCGCCGA GAAAGCGCGT CATTATCCCA ATTTGCCCGT GGCACTGAGC
GGCGGCGTGT TCCAAAGCCG CACTCTCGCC AATGCCACTG CCCAGAGGCT TTCACAACAA
GGGCTTAAGC TGCTGCCATG GGGGCAGGTC CCGGTAAATG ATGGTGGTAT CGCCCTCGGC
CAACTTTGGT ACGCAGTTCA CAGTTTTGGT CAATAA
 
Protein sequence
MSVKISLHIT GIVQGVGFRP HVFRLAHELG LGGSIRNDAE GVFIRLMGAQ HKVDDFIEKL 
RSAPPPLARI DGFTLIADDG SDLDETRFVI EESQAGGEAQ VVVSPDKSMC PDCLADIRNP
SDRHYRYPFT NCTNCGPRYT LIKALPYDRK HTSMAHFAMC AQCEAEYKNP LDRRYHAQPV
SCPNCGPKLS LTGPDGSLIS SDAEFCLDEC ARLLKQGAIF AIKGLGGFHL VCDATHEEAV
SSLRTRKVRP AKPLAVMVKD IAMARSFADG AQEEWQLLSS RERPIVIMKK AAEQHRAVPK
DTQHRELAAS VAPGIDRIGL FLPYTPLHAL LLDSVDRPLV ATSANRSGEP IIIAAVDIHQ
KLAGVVDFIL DHDRPIVSGC DDSVVQWCAG QLQVIRLARG YAPLAMLSPK PSKSHLMAVG
PQQKNTLGFG LGHNLFLSPH IGDLFSIEAE DYFIRTLESF KRLYQVEPGC VVADHHPDYA
PSRFARRYAQ SGDRAIELLT VQHHFAHILS VMAANSRSEP VLGFSFDGTG LGDDMSLWGG
EVLLCSATHF ERLGGLMPFA LPGGDKASSE PWRVLLSLLA GHLSASQLQG LPCFAHLSQP
MLNNHLKVIG RPGTLQSSSM GRLFDAAAAL LGICQRVEFE GQAGMLLEAF AARATGPIPT
LVLEDKQGQW DGAALLVSML EQMDTLDTSP ECLANAFINA IADAIAEKAR HYPNLPVALS
GGVFQSRTLA NATAQRLSQQ GLKLLPWGQV PVNDGGIALG QLWYAVHSFG Q