Gene Sbal223_2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2069 
Symbol 
ID7088362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2450084 
End bp2451244 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content47% 
IMG OID643460972 
Producttetratricopeptide repeat protein 
Protein accessionYP_002357996 
Protein GI217973245 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000265574 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000285968 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCTTGAGA TACTCTTTCT GTTGCTTCCC ATTGCTGCCG CCTACGGTTG GTATATGGGG 
CGACGGAGCA TAAGGCAGAA TCAAAGCAAT CAACGTAAAC AATTGAGCCG GGACTATTTT
ACTGGCCTCA ATTTCTTACT GTCGAACGAA TCGGATAAAG CGGTTGATTT GTTTATCAGT
ATGCTAGATG TTGACGATGA AACCATAGAT ACCCATCTGT CACTTGGTTC GCTGTTCCGC
AAGCGAGGCG AGGTCGACCG CTCCATTCGT ATCCATCAGA ATTTAATTGC CCGTCCAAAC
CTAGCCAATG AGCAGCGCGA CATCGCTATG ATGGAGCTGG GTAAAGATTA TCTGGCCGCG
GGTTTTTACG ATAGGGCCGA AGAAATCTTT ATCAACTTGG TTAGCCAAGA CGATCACAGT
GAAGAGTCAG AAACTCAGCT CATTGCTATC TATCAAGTGA TTAAAGAGTG GCAAAAAGCC
ATTGATATCA CTAAGCGTTT GAGTCGTAAA CGTCAGCAAG TCTTAAAGCC GTTAACGGCG
CATTTCTATT GCCAGTTAGC CGATGAAGCC AGCGATGATG CGCAGAAAAT TAAACTGCTG
CAACAAGCGC TAAAGCAAGA TCCGCAATGC GGTCGTGCGT TATTGACCTT AGCGAAAAAA
TTCCTCGATA TTCAAGATTA TGCTCAGTGC AAGCAAATGC TGCTGCAACT GAAAAAAGCC
GATATCGAGC TTTTTGCCGA TGCAATCCCC ACGGCCAAAC AAGTTTATCG CGACACACAA
GACAAAGAAG GCTTCCAAGA GTTACTGGCG GGCGCTATGG CCGATGGCGC GGGTGCTTCA
GTGGTTGTCG CCTTAGCGCA GCACATGATA AGTCTGGATG AGATTAAAGC GGCAGAGACT
ATGGTGCTCG ATGCCCTATA TCGCCATCCA ACCATGAAAG GTTTTCAGCA CTTGATGCAA
ATGCATTTGC GTCAAGCAGA GGAAGGGCAA GCCAAGCAAA GTTTAACTAT GCTAGAGCAG
CTCGTTGAGC AACAAATTAA ATTCCGTCCA AGCTACCGTT GTAAAGAGTG TGGTTTCCCT
TCACATGCAC TTTACTGGCA TTGTCCTTCC TGTAAAAACT GGGGCAGTAT CAAGCGGATC
AGAGGCTTAG ACGGCGAGTA A
 
Protein sequence
MLEILFLLLP IAAAYGWYMG RRSIRQNQSN QRKQLSRDYF TGLNFLLSNE SDKAVDLFIS 
MLDVDDETID THLSLGSLFR KRGEVDRSIR IHQNLIARPN LANEQRDIAM MELGKDYLAA
GFYDRAEEIF INLVSQDDHS EESETQLIAI YQVIKEWQKA IDITKRLSRK RQQVLKPLTA
HFYCQLADEA SDDAQKIKLL QQALKQDPQC GRALLTLAKK FLDIQDYAQC KQMLLQLKKA
DIELFADAIP TAKQVYRDTQ DKEGFQELLA GAMADGAGAS VVVALAQHMI SLDEIKAAET
MVLDALYRHP TMKGFQHLMQ MHLRQAEEGQ AKQSLTMLEQ LVEQQIKFRP SYRCKECGFP
SHALYWHCPS CKNWGSIKRI RGLDGE