Gene Sbal223_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2004 
Symbol 
ID7086838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2364513 
End bp2365670 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content50% 
IMG OID643460907 
ProductCupin 4 family protein 
Protein accessionYP_002357931 
Protein GI217973180 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.367195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000511749 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAACTCG ATATTAACGG CTTAACGCCC GCGCAATTTC TAGCAGAGTA TTGGCAGAAA 
AAACCCTTGG TGATCCGCCA AGGATTCAAG CATTTTCAAG ATTTAGTTTC GCCCGAAGAA
TTAGCTGGCT TAGCCATGGA TGAGCTGGTG GAATCTCGGC GGGTGTACCA ACAAGCTGGC
CAATGGCAGG CGGAATTTGG TCCCTTTGAT TCCTACGATC ATCTCGGTGA ACGGGATTGG
ACTCTGATCG TCCAAGCTTT GAATAACTGG GTGCCCGATG CGGAGGCCTT GATCCAATGC
TTTGATTTTA TTCCGCGCTG GCGTTTAGAT GATGTGATGG TGAGCTTTGC GACTCCTGGC
GGTGGAGTAG GCCCGCATAT CGATCTGTAT GATGTGTTTA TTTGCCAAGG TTCGGGACGT
CGCCGTTGGC GTGTGGGCGA TCTGGGGCCG CACAAAGAGT TTGCCGCCCA TCCCGCTTTG
CTGCACACAG AAGCCTTTGA ACCGATTATC GATACTGAGT TGTTGCCCGG CGACATCCTT
TATATTCCCC CCGGATTCCC CCATGACGGC ATAACCTTAG AACAGTCGTT AAGTTTTTCA
GTGGGTTATC GCACCGCCTC CGCTAAAGAT ATGATAAGTG CTTTGGCCGA TCATTTGTCT
GAACAGGATT TAGGCGCACA GCAGATTGAA GATCCAGATC GCGAGCTGAC CACCCGCAGT
GGCTGTGTCG ATAATGGCGA TTTAGCGCGG CTACGTACCC AACTGACCAA TATGTTGACC
GATGAACTGG TGAGCGAGTT TTCGGGCCGC TATTTAACTC AGTCAAAATG CGCCTTAGAT
TTACCCGATG AGCCATTGGA CATCACGCAA GACGAAGTGC TCGCTTGGCT CGATGAGCAG
CCGCTTATTC GCCTCGGCGG GCTGCGCTGT TTGTATTTTG ATATCAATGT GGCACAGGGC
GTTGTCTATA TTAATGGTGA TAAGTATCAG CTTTCAGCCG AATTGGCCGC AGTGATCCCA
TTACTATGTG ATAGTAATCA GTTGGATAAA GCTGCCTTAG CCCCTTGGTT AGCCCATGCT
GATTTGCTCA CGCAACTTAC CGAGTGGGTG AATCTAGGCT ACTGGTACTT TGAAGATCTC
AGCGATGAAG AGTGTTAA
 
Protein sequence
MQLDINGLTP AQFLAEYWQK KPLVIRQGFK HFQDLVSPEE LAGLAMDELV ESRRVYQQAG 
QWQAEFGPFD SYDHLGERDW TLIVQALNNW VPDAEALIQC FDFIPRWRLD DVMVSFATPG
GGVGPHIDLY DVFICQGSGR RRWRVGDLGP HKEFAAHPAL LHTEAFEPII DTELLPGDIL
YIPPGFPHDG ITLEQSLSFS VGYRTASAKD MISALADHLS EQDLGAQQIE DPDRELTTRS
GCVDNGDLAR LRTQLTNMLT DELVSEFSGR YLTQSKCALD LPDEPLDITQ DEVLAWLDEQ
PLIRLGGLRC LYFDINVAQG VVYINGDKYQ LSAELAAVIP LLCDSNQLDK AALAPWLAHA
DLLTQLTEWV NLGYWYFEDL SDEEC