Gene Sbal223_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1147 
Symbol 
ID7087600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp1349236 
End bp1351617 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content45% 
IMG OID643460058 
Producttype IV pilus assembly PilZ 
Protein accessionYP_002357085 
Protein GI217972334 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000254962 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTTTAG ATAACCATAG TGCACTCATC GAACAGCTCA AGCCGTTACT CATGGAACCT 
AACTTCCAAG AGATTTTCCA GCAGCTGACG ATAGATGAAA CCAATTCGAC CCGTTTTCTT
GTGAAAATGG AATTGAGTCG TTTGGCTTCG CCTTGCACCC GAATTATCGA TCTCAGGGAT
AAATCAGAGC TTCCTTGTAC TGAAGTCATG TTAGGTCAAC AGCGACATTT TTTAGATGAG
CCAGCCAAAA GCAGTATGCA TGAGGCCATG TCACTGTTTC GCAACCAATA TACCTTAGGT
GTGTATGAAT ATGTGCTCAA TGCCCATCAA CAGAGGCGGG TAAAATTACG CCAAGGTGTG
ACTCAGATTG AAGCCGCCGA GCCAGAACCT TTTATGGTGC CAGGCGTAGT ATTAGGCAGT
TATTTCAACC GCGCTGAGGA GCGGATGAAT TACAGTATTC GCATTGCGGC ATCACAAACG
GGTCGCGGTG AAGTGCCCGG GATCACGGCA GATTTATCTA TTGGCGGCGC CCGTATTCGT
CTTGCAGCCA ATCATCCCTT CGATCTCGAC CAACCACTCA AAGTTAAGTT ATTGGAACTC
AGTGAAGAGT TTTACTATCC CGATCTCCAG CTAGGAGTGG ATTATCAGAT TGTCGATAGC
CAAACCAATG GCGAATATAT TTGGCTAAGA CTCAAACGCC TTGGTGGCAG CGAAGCCTTA
AGTGAAATGC TCGGCAATTT AATCCGCGGC TATAAACTGC GTTACAAGGT AGATGTCAAT
GATGTGATAG TGAGTGCAAC AGGACTCGGA TTTGAACGCC ATTATTTACC CCACCTGCCT
CACTTACCTT TGTATTTCAA TACTCAAACC CAAGGTTTGA GTCACATGCT ACTCAGCCGT
GACAACCAAC AGATAGTGCA TTATTTTCAA GATGAGAATG ATGTCAGCCA GTTACCAGCC
ATGTTGACGC CAACCCGAGT ATCGGCACTA CTGAGTCATC CAGAGAATCC CGATCACGGG
CTATTTTTTA GTTTTACCTA TAACGCCCAA GGCTGCTTAT ATTTTTACTC TGCCACCTTA
GCCGAGCTTA AAGCTAAGGG CATGATGCCT TTATTTTTAG GCTTTGCGTC GACCAAACCT
AGCTGGCGGA TCTTTAAAGT CACCCAAGAT AAAATTTATC ATGATAAAGG CTATGGCCGC
GCAACCCTGC CGGGCGATGA GGCTAAATTC AGCCCGCTGG TCGAACAGCA GTTATCGCAG
TTTAGTCATT TACTGCAGCT CATTGATATC AGCAATGAAG ATGCCAGAGG GCATTACAAA
GCTTGGCAAG ACAGCAGTAA TGCCAATGCC CTTAAAACCT TTGGTCAACA ACGATTAACC
ACCAATCAAA TTAAGTTAGT TTCCATGCAA TTTAGCGAGC GACGCCAAGA GGCTCGTTTC
GCCTTTAAAA CCTTAGTCAA CGTGACGCAA GACAAGCTAA AAGCAACGGG CATTACCTTA
GATATTTCTA GCCGCGGCAT GCAATTAACC TTAGATAATC CGACAGATTT CTCATCGAAT
AAACCACTAC TATTAAGCTT TCCCAAACTA CAAACTATTG CAGGCAAAAC ACAGCTAGAT
AACCTCCCCT ACCGTTTAGT GCGCACCCGC AAGAATGGCG TGACACTGCA TTTAACCGCA
GTCATGGGCC ATACCCCCCA TGTGGGGGTG GAGTTTTTAA ATAAACTCAT TGCCCATAAT
AAGGAAAAGC TAGAACAACT GACCGAGAAC AATAGTGAGG CTAATGAACT TGCCGATGGT
TTAAAAAACA TTTTGATGCA TGATCTGCAT TCTGTGCCCT ACTTTGTTGA AAAAACGACA
AAATCGGCCC AAGTCGCGTG TTTAGGCGTC GGTACGCGAC AAGATGAGAT CAGCGATATT
TTTGCTGCGG GCACCTCAGA TACCCTGCAA TATAATTTAG CGCCGCTGCT AAAAGACGGT
TTCTTTAAAC GGGATATTCT CGAACCTATT CGCCAGATGA AACCCCAGCA AGATATGGAT
TTTATTGAAG TGTTTGTTCA ATTGATCCGT CAATCACGTG GCAAAATCTT TTTAAAGTGT
ACTCCCGCCA CGGAAGTCGG TGAAGTTGAT GCTCAGATAA CCTTTATCAA TCAGAGCAAA
GCAGTAGGCC GATTCCTCGC CCTACGCATT TACCGTGGCG CAACAGAGAA GCCAGATATG
AGTTACCTGC GCCGCGAGTT GGAATACATT AATATTCATG CAAACCATAA AGCCAAACAG
TTAGAAGAAC AACTATGGCG AGTGATTGGC GTCGGTGAAC TCTTGGATAT CACCCAAGAG
GTTGAATTAA GATACCCAGT GCTCTACAAA AAACAGTCTT AA
 
Protein sequence
MSLDNHSALI EQLKPLLMEP NFQEIFQQLT IDETNSTRFL VKMELSRLAS PCTRIIDLRD 
KSELPCTEVM LGQQRHFLDE PAKSSMHEAM SLFRNQYTLG VYEYVLNAHQ QRRVKLRQGV
TQIEAAEPEP FMVPGVVLGS YFNRAEERMN YSIRIAASQT GRGEVPGITA DLSIGGARIR
LAANHPFDLD QPLKVKLLEL SEEFYYPDLQ LGVDYQIVDS QTNGEYIWLR LKRLGGSEAL
SEMLGNLIRG YKLRYKVDVN DVIVSATGLG FERHYLPHLP HLPLYFNTQT QGLSHMLLSR
DNQQIVHYFQ DENDVSQLPA MLTPTRVSAL LSHPENPDHG LFFSFTYNAQ GCLYFYSATL
AELKAKGMMP LFLGFASTKP SWRIFKVTQD KIYHDKGYGR ATLPGDEAKF SPLVEQQLSQ
FSHLLQLIDI SNEDARGHYK AWQDSSNANA LKTFGQQRLT TNQIKLVSMQ FSERRQEARF
AFKTLVNVTQ DKLKATGITL DISSRGMQLT LDNPTDFSSN KPLLLSFPKL QTIAGKTQLD
NLPYRLVRTR KNGVTLHLTA VMGHTPHVGV EFLNKLIAHN KEKLEQLTEN NSEANELADG
LKNILMHDLH SVPYFVEKTT KSAQVACLGV GTRQDEISDI FAAGTSDTLQ YNLAPLLKDG
FFKRDILEPI RQMKPQQDMD FIEVFVQLIR QSRGKIFLKC TPATEVGEVD AQITFINQSK
AVGRFLALRI YRGATEKPDM SYLRRELEYI NIHANHKAKQ LEEQLWRVIG VGELLDITQE
VELRYPVLYK KQS