Gene Sama_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2065 
Symbol 
ID4604315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2501148 
End bp2502266 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content55% 
IMG OID639781442 
Productcupin 4 
Protein accessionYP_927940 
Protein GI119775200 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.439804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAAGC TCAATCTCGA TATTCAGGCG TTCCTCGCCA ATGACTGGCA ACAGGCCCCA 
AAAGTATTCA AAGGCGCTTT CCCTGACTTT GAAGATCCCA TTGCTGCAGA CGAGCTCGCG
GGTCTTGCCT GTGAAGAGGA AGTGTCCAGC CGCGTGGTTG TCACCAAGGG TAACGACTGG
GAAGTCATTT CCGGTCCCTT CGAAGACTAC GACCGGTTTG GTGAGACCCA TTGGCAGCTG
CTTGTGCAAG CCGTAAACCA CTGGTACCCC GAATCCCAGC CCCTGGTGGA AGCCTTCAGA
TTCTTGCCAG ACTGGCGCTT TGATGACCTG ATGGTGTCTT TCGCCACCCC GCAGGGCGGC
GTGGGACCTC ACATCGACAA CTACGATGTG TTTATCATTC AAGGCGAAGG CCAGCGCCGC
TGGACCGTGG GCCCCAAGGG CAACTACCAG CGCCGCGGTG GTGTAACGAC CTCACCCCTG
ATTGAAGACT TTGAGCCCAT TATCGATGTC GTGCTGGAAA AAGGTGATGT GCTTTATATC
CCGCCCGGCT TCCCTCATCA AGGTGAAACC CTGACTCTGG CACTCTCTTA TTCCATGGGG
TATCGCGCGC CCAGCCAGCA GGAGCTTGCA GGACAAATTG CCGATCAGTT GATGGATGAA
GACAAGGGGC ACAAGCGCTT TATCGCCGTG GATGGCGCCG CGAGCCATGG CACTGTGAGC
CTGGCAGAGC AGCAAGGCAT CATGCAGCTT ATCCGCGACC TTTGTAATGA CACCGATAAC
GTCGTTAAGG TCCTCGGTAA ACTCTTAAGT CAGAACCGCT TCGACCTGGA TATCCAGGAA
GATGAAAGCA TCGATGCCGA CGCCCTGGTT GAGGCTCTGA ATGAAGGGGC TGTGCTGATG
CGGATTGGTG GCCTTAAGGT GCTCAAAATG GAAGGCGACA GCCAGGCAAG GCTCTTTGTG
GCAGGCGAGA GTGTGATAAT TGAAGGCGCT TCGGAAGAAG AGCTGATCGA GGTATCCAAC
TCAGTCAACG TCAATGCTGA GCTGGCGCAG CTGCCACACT GGCAGGGCTT CTTTGTTCAG
ATGCTGCAAA AAGGCTATTT CTATCTCGGC GAAGACTGA
 
Protein sequence
MYKLNLDIQA FLANDWQQAP KVFKGAFPDF EDPIAADELA GLACEEEVSS RVVVTKGNDW 
EVISGPFEDY DRFGETHWQL LVQAVNHWYP ESQPLVEAFR FLPDWRFDDL MVSFATPQGG
VGPHIDNYDV FIIQGEGQRR WTVGPKGNYQ RRGGVTTSPL IEDFEPIIDV VLEKGDVLYI
PPGFPHQGET LTLALSYSMG YRAPSQQELA GQIADQLMDE DKGHKRFIAV DGAASHGTVS
LAEQQGIMQL IRDLCNDTDN VVKVLGKLLS QNRFDLDIQE DESIDADALV EALNEGAVLM
RIGGLKVLKM EGDSQARLFV AGESVIIEGA SEEELIEVSN SVNVNAELAQ LPHWQGFFVQ
MLQKGYFYLG ED