Gene Sde_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2203 
Symbol 
ID3965973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2803220 
End bp2804818 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content45% 
IMG OID637921293 
Productflagellin-like protein 
Protein accessionYP_527675 
Protein GI90021848 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID[TIGR02550] flagellar hook-associated protein 3 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTATAT CTTCGCTGCA AATATTTAAT ATCGCGAACA GTGGCATGGC TGATGCTAAT 
CAGGCCATGG TGAAGACGCA AGAGCAGCTC TCTACCGGCA AGCGTGTTTC AACGCCCTCG
GATGATCCCG TAGCTTCCAC CAAAATTATT CAATTAACTG AAGAGTTGGC CAATATAAAT
CAATATGGCA TTAATAGTGA CCTAGCCGAA AATAGCATTG TGTTACAAGA GTCTGCGCTT
GAAGCGATAA ATAACTTAAT TGTCAGAATG CAGGAACTAG CCGTACAGGC GGGTAACAAC
GCAACATTGG GGCCTACCGA GTACAAGGCG CTTGCTGCCG AAGTAGACAG TCGTTTAGAA
GAGTTACAGA GCTTGCTAAA TACTAAAGGG CCTGGCGGTG ACTATATTTT TGGCGGTTAT
CGTTCAACCA CAGAGCCATT TAGCGGAACT GCAAGTACTG GCTTTAGTTA TAATGGTGAT
GAAGGGCAAA AATTTGTAAA AATAGCGAAC AATACTCATA TTGCCGCTAG TGATTCTGGC
AAAAAAATAT TCGTTGATAT ACCAAGCGCA GAAAATACAG TGCGTACTTA CCCTAGTTCT
AATAATCGCT CTTCTCCGCC AGTAGAGGTG TCTGTTGGTC AAGTATTTGA TCAAGAGGCT
TACGACGAAT TCTATCCTGA AGATATTGTT ATTTCGTTTA ACCATGAAAG CACGGTGGTG
CCGGCAGGAA AAAACTACAC GGCAACCGAG CGCTCGACCG GCAAAGTTAT TGTTGCTAAT
CAGCCGTTTC GCTCTGGTGA ATTAATTGAG TTAAAGGGCG TACGCTTTGC TGTAGTGGGT
GAACCAGCCT CTGGAACTGC AGCGCAAGCA GCTTTACTTA ATTTTGATGC AGACGGTGTC
GACGCAAGAC CCATTGATTT TACTGCAACC CCAGAGACAT TCGACATAAC TGTTCGAGGA
CGAAAAGAAA CACTGGTATT GGACGGCCCT ATTAACAGCG CTGCAGACCT ATCAACTGTA
TTAAATTCTG CGGGTAACGG TAATGCTGCT GCCCTAGCTA GATTGGGCGT GGCGGTAGAT
GCTACGGGGT TTAATATGCC AGCAGGGCTC CAAATTACGC TATCGAATGG TTCTGCAAAC
ACTAACGCAG CGCTCGGTGT TACATCCGCT ACTCAGAGTG TATCCTCTGG CGGTGTGACA
GCTGTTGCCG GTGACCGAAT ATTTATTGAA GCAACCAATA AACAAGATGT ACTTACAACT
CTGGCGCAAA TGAGTGAATC GATGAAGACC TACGAGGGTG GCACCGAAGA TAGACGGTCT
ATTGAAAAGG TGATTGCTTC CACACTTAAG AATCTATCTA ACGCACAAAC ATCTATATCG
AATGTCACTT CTGAGCTTGG TGCCAGATAT AATACAATTG ATAGTACACG ATCACTTCAT
TTAGACTCGG AAGTTGTTAT AAATAAATTT CTAGCAGACT TAAGAGATGT AGACTACGCC
GAAGCAGCTA CGCGTTTAAG TGTCGAAACC CTTGTGTTAC AAGCTGCGCA GTCTTCTTTT
GTGAGGGTTT CTCAGCTCAC GCTATTCTCG CAGTTATAG
 
Protein sequence
MRISSLQIFN IANSGMADAN QAMVKTQEQL STGKRVSTPS DDPVASTKII QLTEELANIN 
QYGINSDLAE NSIVLQESAL EAINNLIVRM QELAVQAGNN ATLGPTEYKA LAAEVDSRLE
ELQSLLNTKG PGGDYIFGGY RSTTEPFSGT ASTGFSYNGD EGQKFVKIAN NTHIAASDSG
KKIFVDIPSA ENTVRTYPSS NNRSSPPVEV SVGQVFDQEA YDEFYPEDIV ISFNHESTVV
PAGKNYTATE RSTGKVIVAN QPFRSGELIE LKGVRFAVVG EPASGTAAQA ALLNFDADGV
DARPIDFTAT PETFDITVRG RKETLVLDGP INSAADLSTV LNSAGNGNAA ALARLGVAVD
ATGFNMPAGL QITLSNGSAN TNAALGVTSA TQSVSSGGVT AVAGDRIFIE ATNKQDVLTT
LAQMSESMKT YEGGTEDRRS IEKVIASTLK NLSNAQTSIS NVTSELGARY NTIDSTRSLH
LDSEVVINKF LADLRDVDYA EAATRLSVET LVLQAAQSSF VRVSQLTLFS QL