Gene Spea_3920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpea_3920 
Symbol 
ID5664304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella pealeana ATCC 700345 
KingdomBacteria 
Replicon accessionNC_009901 
Strand
Start bp4768626 
End bp4769636 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content48% 
IMG OID641238584 
ProductBNR repeat-containing glycosyl hydrolase 
Protein accessionYP_001503765 
Protein GI157963731 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAT TCATGCTGTT AATAAGCTTA TTTAGTTTTG CTACTGTGGC GATGGAATGG 
AATATTCAAA GCCTAGCCCC CGGTGTTTCA TTTCGAGGTA GCGCAGTTTT GGATGGCGTG
GTTTGGGTGA CTGGGACAGA TAATCGCGTC TATATCTCTA AAGATTCAGG TAACAGCTGG
CAAGATGTAT CGGTAAAGGG GCTGCCTTTG ACTGACTTTC GCGATATTGA AGTGTTCGAT
GCCAATACGG CGATTGTGAT GGGAGCGGGT GAAGGTGGCT TATCTAAGCT GTATATCACT
CAGAACCAGG GTCTAAGCTG GCAACTCTTG TTTGATAATC CCGATGAGCT TGGCTTTTTC
AACTCAATTG CCTTTTGGGA TCGTAATAAC GGGTTACTGC TGGGAGATCC TGTTGGTGGT
TGCTATGTGA TATTACGTAC CTCTGATGGT GGTAAAAGCT GGCAACGAGT TGCCCAAGGT
GAGTTGCCTG AAATGCTCGA TAAAGAGGTC GCCTTTGCCG CCAGTGGTAA TACATTAATT
GTTGGCAAAA GAGGGCGGGC TTGGTTTACC ACCGGAGGCT ATAGTAGCTC AGCTTATAGT
AGCGCTGACT CGGGGCAGCA CTGGCGCCGG CGTCCAATTG CTCTCTATGA TGATACCCAA
ACCGCAGGTG GCTATGCGCT AGCATTTAAT CATTTAGGTG ATCTATTTGT GCTGGGAGGA
GATTATCAGC AACGAGATAA GTTTGATGCC AATATGGTCT ACAGGAAAGG CTATGTATGG
CATAAAGCGC CGGGTACCAC ACCAGGCCTA CGCACAGCCA TGGCGTGTTA CCAAGAGATT
TGCATCGCGA CCGGTAAGCT ATCGTCAGAT ATTTCAATCG ATCATGGTTA CAGTTGGCAG
CCACTGTTGA TCAATGGTCA GGCTCAGGGC TTTTTTACGC TCGCGATTGA CGGTAACACC
TTAGTGGCTG GTGGGCATGA TGGCCGAGTC GCCGTGTATA CTTTTGAATA G
 
Protein sequence
MKIFMLLISL FSFATVAMEW NIQSLAPGVS FRGSAVLDGV VWVTGTDNRV YISKDSGNSW 
QDVSVKGLPL TDFRDIEVFD ANTAIVMGAG EGGLSKLYIT QNQGLSWQLL FDNPDELGFF
NSIAFWDRNN GLLLGDPVGG CYVILRTSDG GKSWQRVAQG ELPEMLDKEV AFAASGNTLI
VGKRGRAWFT TGGYSSSAYS SADSGQHWRR RPIALYDDTQ TAGGYALAFN HLGDLFVLGG
DYQQRDKFDA NMVYRKGYVW HKAPGTTPGL RTAMACYQEI CIATGKLSSD ISIDHGYSWQ
PLLINGQAQG FFTLAIDGNT LVAGGHDGRV AVYTFE