Gene Sbal195_3386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_3386 
Symbol 
ID5755190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp4008123 
End bp4009439 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content45% 
IMG OID641289719 
Producthypothetical protein 
Protein accessionYP_001555808 
Protein GI160876492 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4968] Tfp pilus assembly protein PilE 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00409988 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGCTC AAATATTACG AAAAAACAAT AATCGAGGCT TCAACTTGTT AGAGATAATG 
GTTGTTGTAG CGATTATTGG AATACTCGCG GTTGTGGCGG TTCCTTTGTA TAAAGATTAC
ATAATACGTG CGCAGGTTAC AGAAGCTTTT GTTTTTGCCG ATGCGGAAAG AATTAAAGTC
ATTGAAAAAC GTATTGAAAG CACCAACGTT GATATTGCAA CATTCTCTGA ACCAAAAGTA
CATATGACTT CTTTGATGTG GGTGCCTGTA ATAAATAACC AACCAGTCGA AAATTCGGTT
ATCGGATATA TTCTTCCCAC TATGGATTTA AAAGGTCTTG GGGTGAGAGA TACGTTTGCG
CTTGAGTATT TTTACAATGG CTCTTGGCGT TGCGTTAATG CTGCCAATGC GATTGGTCGC
GATGCAGTGT CTACCAATAA AGCGCTTGAT GACAAGTACC TCCCAAGTAG CTGTCGTACC
GGAGCCGGCT TGTTGGCGGC ACATCCTAAG GCTCCTGCGG GCTGCCCCCC TGGGACTCAA
AAAGCACAAG TGAAAGATGC TAGCGGGAAA CAGCAGCAAG TCTGCCAGCG ACCTAAGCCT
CAGGTACAAC CGCAGGCACA ACCTCAGGCA CAACCGCAGG CTCAACCTCA GGCACAACCT
CAGGCACAAC CTCAGGCACA ACCTCAGGTG AAACCTCAGG CTGTGGTTAA GCCCCCCTCC
TGTCCTATAG GCCAAGACTG TAGCCATAAG GATCCTAAAT GTTCAGTAGC GGGACAAGAA
CACATAGATC AAACCTCGGT TCCTGTACGT GATGCATACA ACCCGTTTCC AGCGGTTGGG
GCGGGGATTT ACACCACTAA GAGCCAAGCT GTGCCGACGG GTTGTGTAGC TAAGTGTAAA
CCCGGTTTTG TTTTTAATCC CAATGAACCT TCAAAGTGCT CAATAGCCCC CTCAACAAAT
AATCATACTT GTCGAGGCCC CAAATTTATT TGTGAGCGAA GCCATGTCAC TACAGGGGCC
GCGTGTACAG TTGACGCGCC TTATGCTGCC AATTTTATTG AAAATTTAAA AGATGGTTCT
CGTTACGTGA CAAGAGGCTG TGTTACACAG CAAGAGGCAT TTGCAGCTGA CAAGTACAAT
AAAGGCAATG ACAATTGTAA AAATTATAAT GTTGTAGTGC TACAAGACGC TCACTTTAAA
TGTACATTCG CTTGCTATGG TGATGCTTGT AATCTTGAAT CTGTTCCAGA TCATCCAGCA
ACATGGGCCG ATGGTAAATC GTCAACGGAT TTACCTGATC AGTTTAATAC TCCTTAA
 
Protein sequence
MNAQILRKNN NRGFNLLEIM VVVAIIGILA VVAVPLYKDY IIRAQVTEAF VFADAERIKV 
IEKRIESTNV DIATFSEPKV HMTSLMWVPV INNQPVENSV IGYILPTMDL KGLGVRDTFA
LEYFYNGSWR CVNAANAIGR DAVSTNKALD DKYLPSSCRT GAGLLAAHPK APAGCPPGTQ
KAQVKDASGK QQQVCQRPKP QVQPQAQPQA QPQAQPQAQP QAQPQAQPQV KPQAVVKPPS
CPIGQDCSHK DPKCSVAGQE HIDQTSVPVR DAYNPFPAVG AGIYTTKSQA VPTGCVAKCK
PGFVFNPNEP SKCSIAPSTN NHTCRGPKFI CERSHVTTGA ACTVDAPYAA NFIENLKDGS
RYVTRGCVTQ QEAFAADKYN KGNDNCKNYN VVVLQDAHFK CTFACYGDAC NLESVPDHPA
TWADGKSSTD LPDQFNTP