Gene Sde_2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2082 
Symbol 
ID3967466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2662284 
End bp2666021 
Gene Length3738 bp 
Protein Length1245 aa 
Translation table11 
GC content47% 
IMG OID637921172 
ProductTfp pilus assembly protein FimV-like 
Protein accessionYP_527554 
Protein GI90021727 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3170] Tfp pilus assembly protein FimV 
TIGRFAM ID[TIGR03504] FimV C-terminal domain
[TIGR03505] FimV N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0604144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.660247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCATC GTTCACTGGC TATGCTCTTC GGCGCTGCGT CATTGTTAGC ATCGCAAATG 
GCAAGTGCTC TAGGCTTGGG CGAGGTAAAT CTCAAATCCT CCCTTAATCA GCCGTTAGAT
GCTGAAATCA AACTACTTCA GATACGCGAT CTTACTAAGC AAGAGATCCT TATTGGGTTA
GCGTCTCGCG ATGATTTTGA ACGCATAGGT GTAGATCGGC CATACTTCCT ATCTGATTTA
TCCTTCGAGG TAGATATCAA TAACGCCAGC GGCCCAATCG TTCGCGTTAA GTCTACCAAA
CCAGTGCGTG AACCCTTTCT AAACTTCATT ATTCAGGCTC AGTGGCCAAG CGGTAAAGTG
CTGCGAGAAT ACACTGTGCT ATTAGATTTA CCTGTTTTTG CAGATGCGCC AGCGCAGCCT
GTTGCCGCCA CGCAAACGCA GGGTGCACAG AGGTCCGCCG AAAAGAAGCC TGTAGCGCAA
GCGCCTAAGA GTGATACGCG CTACAACCCA CGTTCTTCTT TTGATGAGGG TCGTGCCTCT
ACTAATGCCT CCGCGCCATC ACAGGCTCAA TCATCAGCAT ACGCTGGTGT AGATGTAATT
GGGCCAGTTA AGGCCAACCA AACCTTATGG GAAATTGCAG CTCAGGTGCG TCCAGACAGC
AGTGTGTCTG TTCAGCAAAC TATGTTGGCT ATTCAGCGAT TAAATCCCGA TGCGTTTATA
AATAACAATA TTAACTTGTT AAAGCGCGGT CAAATTCTAC GCGTACCTGA TGCCTCACAA
ATTAAAGAAT TCACACGCCA AGAGGCGGTG CGAAGCGTGG CACAGCAAAA TGCCGCATGG
TCTGGCAAGT CATCTGCTGG CGATGTGCAG TTGTCTGGTA GCAAAAGCTA CACCTCTAGC
AGCAGTAATA GCGACAAGGC TGAGGGTAGG CTTAAGCTTT ATTCACCAGA AGACTCGAGT
GATGCGGCAT CTGGCCGTAC CAGTGGTTCT GGCTCCAGCA GCACCGAGGC ACTTGAGAGC
GAGTTAGCTA TAACCTTAGA GCAACTTGAA AAAACTGAGC GCGACAACCA AGAAATGCGC
TCAAAAATCG AGTCGCTAGA AGAGCAGATC CAAACAATGG AGCGCTTGGT TGAGGTAAGT
AATGCTGATT TACGTGCTCT TGAGTTGGCT GCAGAGAAAA ATCTACAAGA CAAAAAAGAA
AGTGGCGAAG TTAACGAGTC TGTTACGCCG GCAGAAACTG AAGTTGCCGA GTCTGCTACC
GAAACAACCG ACATGGCTGA AAAGCCTGCA GATGTAGTAG CGCCACCTCC TGTAAAAGAA
GTTCCAAAGC CCGAAGTGGT TGAGGCCGCT AACAAGCCAG ACCCATCCAA AGTTGTACGA
ACCCAGCGCA AACCAGAGCC ATCTATTGTT GATATGCTAA TGGACAACAT TCTTTTTGTT
GCTCTTGGTG TAGTTGCAAT ACTCGGCGCA ATTGTTCTTT TTGTGCGGTC ACGTGGCAAG
AAAGACGAAT TTGAAGAGGA TGACTTCCTA GAGCAGACCA CATTTGAGGC GCCAGAGGCC
AACGAAGAAG ATTTGCTTTC TTTGGGTGAT ATAGATACTG GCGAGACGGA AAGTGAGTTT
GAGCAGCTAG AAGAGGAAGC GCCGGAAGAA GAAGTAAGCG CAGAAGCTGA GACTGGTGAT
GCAGCTGCCG AAGCAGATAT TTATATTGCT TATGGCAAGT ACGATCAAGC AGAAGAAATG
TTGGTAACGG CATTAGAAAA AGAGCCGCGC AACATTGAAG CTAGATTAAA GCTGCTTGAG
GTATATGCCT CTCAGAATGA TGTGCATAAG TTTGACCCCC ATTTCGCTGT TATATATGCC
GAAGGTGATG CTAGCTACGT TGAGCGCGGC CAGCAATTGC GCGCTGGTAT TGCAGATGCT
GGTGAATTCG ATGCAGATTT GTATGTAACT GATGTGTTGG GTGAAACATT CTCTGGTTCG
GAAGAGTCTG AGCCTTCGAA TGACGATTTA GATTTTGAGT TAAGCCTAGA TGGTGAAGAG
ACCAGTACAG AGGTAGAAGA AGCGACAACT TCTGAGGCTG ATCTTGATTT TGATCTGGGC
TCGATGGAAG AGCCTACTGA AGCGCAGCAA GAAGAATCGT TAGACTTCGA TTTGGATCTT
GGTGGCTTGG AATCAACTGA TTCTGAAGAT GGCGCTACAG AGTTCGAGCT AGACGTTTCT
GACGTAGATG ATTCTCTGAG TCTTGATTTG GATATAGACG ATATAAATGA ATCTAGTGGT
GATGAGCTTG CTGATTTAGA TGATTTGGAT TTCGATCTAA GCTTGGATGA TGTTGAAGGT
GCTGCCGATA AACCAAGCGA AGATGATTTC TCGTTAGATT TCGATTTAGA TGATGTTAGC
GAAAGTGAAA CACAAATCAC GCCTGCAGTT TCCGATGCTG ACGCCAAGGG CGAGGTGGAT
GACGGCGAAT TTGATTTGAG TGAAGAGTTT TTAAGCTTGG ATCAAGGGGG CGACGATACG
GAAGCTTCTA GCGAAGATGA GCTAATTGCT GACAATTTAG AGGAAGACTT AGATTCCATC
GATTTCGATC TTGGTATCGA AGACTTTAAT GTTGAAAGCG TTGATAAGCC TGCAGAAGAG
GCCGGTGGTG TTGAAGCTGG TTCGCTCGAA GATGATTTGG CTGCTTTAGA TTTGGATTTG
GAGTCTTTGG AGTTAGATTC TGGCGAAGAG CAAACCGAGC TGTCTATTGA AGCGGACGAT
TTTGATCTAG GTACTTTGGA TGATGACGCT TCATTAGGCG CTGACTCTGA AGTCTCCAGT
GAAGCAGACG ATCTTAACGT TGATGATTTG AGCCTTGAGG GGCTGGATGA GTTAGACGAT
TTGGATTTGT CATTAGATGA TGCTGCAGAC ACTACACTCG AAGATATTGA TGACTCGTTG
TCGTTAGAGG GTGATGACGG TGAATTTGAT CTTTCGTTAG AAGACGGTTT GCAGCCGGCT
AGTGAAGAAA GTGCTGAAGC GGATATCGAA TTTGATTTAG GTGCAGATGG CTTAGATTTA
GAAAACTTAG ATGTAGATGA ACCTGCAGTA GAGCAGGCGT CAGTTGACAC TGAGCTGGAT
GTAGCAGACA GCGATGACCT CGGAGAGGTG GATGAGCTAG AGGGGGATCT TGATCTTTCT
GCGCTCGACG ACGAGCTTGA TGCTTTAACC GGTGATTTAG ACTTAGATGA CCTAGAGGCA
GACTTTTCGG AAGAAGCATT GGCTGCAGAG GTGGCAAGCG AAGATGCTGC AGAAGCGCAG
CCTTCAACGC TAGAGATGGA AGAGCCAGTT ACTGACTTTG GTGACTTGGA TGGGCTTGAT
ACCTTAGGGG ATGATTTAGA ACTGGAGTCT GAATTGGATG TGCCAGAGCT TGACGTACCT
GAGTTAGGTT CTGGGTTGGA GCCAGAGCAA GCCACAGAAG AAATGGGTGA TGACACCCTT
TTTACAAAGG CTATTTCCGA TATTCCAGAT GAAGATCTAG ATTTTGAAAT TCCTGAAATC
GACCCAGATT CTATGGATGA TGACTCTGAT CTTGGCTTCT TAAGTGATAG TGACGAGACG
GCTACCAAGC TTGACTTGGC GCGCGCATAC ATCGATATGG GTGACGCTGA AGGTGCTCGC
GATATTATCG AAGAGATCAA GAAAGAGGGG AACGATCAGC AGAAGGAAGA GGCTGATAAG
CTTCTATCCC GTATATAG
 
Protein sequence
MGHRSLAMLF GAASLLASQM ASALGLGEVN LKSSLNQPLD AEIKLLQIRD LTKQEILIGL 
ASRDDFERIG VDRPYFLSDL SFEVDINNAS GPIVRVKSTK PVREPFLNFI IQAQWPSGKV
LREYTVLLDL PVFADAPAQP VAATQTQGAQ RSAEKKPVAQ APKSDTRYNP RSSFDEGRAS
TNASAPSQAQ SSAYAGVDVI GPVKANQTLW EIAAQVRPDS SVSVQQTMLA IQRLNPDAFI
NNNINLLKRG QILRVPDASQ IKEFTRQEAV RSVAQQNAAW SGKSSAGDVQ LSGSKSYTSS
SSNSDKAEGR LKLYSPEDSS DAASGRTSGS GSSSTEALES ELAITLEQLE KTERDNQEMR
SKIESLEEQI QTMERLVEVS NADLRALELA AEKNLQDKKE SGEVNESVTP AETEVAESAT
ETTDMAEKPA DVVAPPPVKE VPKPEVVEAA NKPDPSKVVR TQRKPEPSIV DMLMDNILFV
ALGVVAILGA IVLFVRSRGK KDEFEEDDFL EQTTFEAPEA NEEDLLSLGD IDTGETESEF
EQLEEEAPEE EVSAEAETGD AAAEADIYIA YGKYDQAEEM LVTALEKEPR NIEARLKLLE
VYASQNDVHK FDPHFAVIYA EGDASYVERG QQLRAGIADA GEFDADLYVT DVLGETFSGS
EESEPSNDDL DFELSLDGEE TSTEVEEATT SEADLDFDLG SMEEPTEAQQ EESLDFDLDL
GGLESTDSED GATEFELDVS DVDDSLSLDL DIDDINESSG DELADLDDLD FDLSLDDVEG
AADKPSEDDF SLDFDLDDVS ESETQITPAV SDADAKGEVD DGEFDLSEEF LSLDQGGDDT
EASSEDELIA DNLEEDLDSI DFDLGIEDFN VESVDKPAEE AGGVEAGSLE DDLAALDLDL
ESLELDSGEE QTELSIEADD FDLGTLDDDA SLGADSEVSS EADDLNVDDL SLEGLDELDD
LDLSLDDAAD TTLEDIDDSL SLEGDDGEFD LSLEDGLQPA SEESAEADIE FDLGADGLDL
ENLDVDEPAV EQASVDTELD VADSDDLGEV DELEGDLDLS ALDDELDALT GDLDLDDLEA
DFSEEALAAE VASEDAAEAQ PSTLEMEEPV TDFGDLDGLD TLGDDLELES ELDVPELDVP
ELGSGLEPEQ ATEEMGDDTL FTKAISDIPD EDLDFEIPEI DPDSMDDDSD LGFLSDSDET
ATKLDLARAY IDMGDAEGAR DIIEEIKKEG NDQQKEEADK LLSRI