Gene Spea_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpea_2000 
Symbol 
ID5662393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella pealeana ATCC 700345 
KingdomBacteria 
Replicon accessionNC_009901 
Strand
Start bp2423747 
End bp2424892 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content46% 
IMG OID641236595 
Productcupin 4 family protein 
Protein accessionYP_001501855 
Protein GI157961821 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTAG ATATCAATGG TTTAACACCA GAGCAGTTTC TCAAAGAATA CTGGCAAAAA 
AAACCACTAG TGATCCGTCA AGGATTTAAA AACTTTCAAG ATCTGCTCTC ACCTGACGAG
ATGGCAGGGC TGGCTTGTGA TGAAATGGTG GAGTCTCGTC GAGTTTATCG AGAGAAGGGA
GACTGGCAGG CGGAGTTTGG GCCATTTGAA TCTTATGAGC ATTTAGGCGA GAAGGATTGG
ACCTTAATCG TACAGGCATT AAATAATTGG GTTCCTGCAG CTGAAGACTT ACTTAAATGC
TTTGATTTTA TACCTCGCTG GCGTTTAGAT GACGTGATGG TTAGCTACGC GGTTCCCGGT
GGTGGAGTTG GGCCGCATAT TGACCTTTAT GACGTGTTTA TCTGTCAAGG TTCAGGGCGT
CGCCGATGGC GTGTAGGCGA CTTAGGTCCT CATAAAGAGT TTGCTGCGCA TCCAGCACTG
CTGCATACCG AAGCATTTGA CCCGATCATA GATGTTGAAC TCTTGCCTGG TGATATTCTC
TATCTACCGC CAGGGTATCC TCATGATGGC GTGACACTTG AGCCGTCAAT GAGCTTTTCT
GTTGGTTACC GAACAGCTTC GGCGAAGGAT ATGGTGAGTG CTTTGGCCGA TCACCTAATC
GATAACGAAC AAGGTACTAA GCAGATCACG GATCCCGATC GTGGCTTGAG CCAACATTCT
GGTTTAATCG ATGAACAAGA TCTAGGACGT ATCAAGCAGC AACTCATCGA AACGCTAGAT
GATACACTGA TCAGTGAATT CAGTGGTCGC TATCTGACTC AATCTAAGTG CGAATTAGAC
CTACCTGAAG AGCAATTAGG CTTCCAATTA GACGATATAA AGGCAATCAT CTCAGAGCAG
CCGTTAATTC GTTTGGGCGG GCTACGTTGT CTTTATTTCG CGACCAGTCT TGAGTCTGGT
GTCATGTACA TTAATGGCGA GCAAGTGGAA CTAGGAGAAG GCAGTTCAGA GGTGATTGAA
GCGCTCTGTA ATCAACAGCA GCTAACCCTC GAAGACATGT CTACTTGGTT AGAAAATGAA
GCTGTGATGA CTCAATTGAC CGACTGGGTT AATGCGGGTT ATTGGTACTT CGATGATGTA
GAGTAA
 
Protein sequence
MQLDINGLTP EQFLKEYWQK KPLVIRQGFK NFQDLLSPDE MAGLACDEMV ESRRVYREKG 
DWQAEFGPFE SYEHLGEKDW TLIVQALNNW VPAAEDLLKC FDFIPRWRLD DVMVSYAVPG
GGVGPHIDLY DVFICQGSGR RRWRVGDLGP HKEFAAHPAL LHTEAFDPII DVELLPGDIL
YLPPGYPHDG VTLEPSMSFS VGYRTASAKD MVSALADHLI DNEQGTKQIT DPDRGLSQHS
GLIDEQDLGR IKQQLIETLD DTLISEFSGR YLTQSKCELD LPEEQLGFQL DDIKAIISEQ
PLIRLGGLRC LYFATSLESG VMYINGEQVE LGEGSSEVIE ALCNQQQLTL EDMSTWLENE
AVMTQLTDWV NAGYWYFDDV E