Gene Sde_2991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2991 
Symbol 
ID3967752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3801424 
End bp3804465 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content45% 
IMG OID637922088 
Producthypothetical protein 
Protein accessionYP_528460 
Protein GI90022633 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000283322 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000325565 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCAATA ACAATAAAGA GGCTAGACAA GTGAATACAA CCGAGAGAAC CCCTAGTAAC 
GGACGCTTTA GAAAGTCGCT TCTAGTAAGT GCTATCGCTG CTGTGTCCTG TATTAACGTT
GCGCATGCAC AGGAGTCGGA AGGTCAGCTT GAAGAAGTTG TTGTTACCGG CACGCGCGCA
ACAATCCAAT CTACGATCGA TATTAAACGC AATTCCACCA CTATTGTTGA TGGTTTATCT
GCTACCGATA TTGGTGATAT GCCAGCGCTT TCTATTGGTG AGGCGTTAGA GAATATTACA
GGTGCTGCTT CTCACCGTGA GAACGGCGGC GCAACCGAAA TTTCTATTCG TGGTCTTGGC
CCATACTTGA GCGCCACTAC CTTTAACGGC CGCGGTGCTA CCAATGGTAG TGGTGACCGT
TCTGTTAACT TCTCTCAGTT CCCATCTGAA TTAGTAAACA AAATTGCAAT TTACAAAACC
CAAGACGCCA GCCTAATTGA AGGTGGTGTT GCTGGTTTGA TTGCATTAGA AACTGTTAAG
CCTTTGGAAT ACGGTAAGCA ACGTATCCAG GTTGATGCCA AATTAAACTT CAACCCTAAT
CAAGCAAATA TTGACGACCC AATGAACGGT AACTTGGGCT TCCGTGGCAC GGTAAGCTAT
ATTGATCAGT TCGAAATTGG TGATGCCGCT TTGGGTGTGT CTTTGGGTTA CCAGAAGAGT
GATATTTCTC AGCCAGAATC TGAGATGCGT TCTTCTAGCC CGAGTGGTTC ATCTTTGTAT
GCTTGTTTGT ATGATCCAGA TATTAGTGCT GGTTTTTATC GCACGAGCAG CGGCGATTGC
AATTCTGGTG GCACCTATAA CACTACGAAA GATCCTGAAA CGGGTAAGGC TGTAAATGAT
GGCAGCCCTT ACGGCTTTGC CCCTAGCTCT AATGGCTACC GTCAAAATGA TACGGCTGAT
GAGCGTGATG CGGTTTTTGC TGCGATTCAG TTCCAGCCTA CAGATAAGCT AGATATTAAT
TTAGATGTGC AGCAGTCGAA GCGTGTTCAA GCTGAGCAGC GTCACGACTT AAACTTCGCT
AACATGAGGC GTGTTACGCC AGGCATCACT GATCAGGCTG TTGTAATTAG CAGTACTGGT
GCCGTTACTA GTTGGGCTGG CGAAACTGCT ATCGAATCTA ACAGCGAGAC ATACTCTCGT
ACTGAAGAGT ATTTGGGTTA TGGTTTGAAT GTTGCTTACG AGCTAACCGA CGACATTACT
ATTTCTGGTG ATTACTCTTT CTCTGAAACG ACTCGTGAAG AAGTGCAGTA TTTATTGCGT
ACTCAATCGA ACAACGGTGA TGTGAATGGT GACTCATCTT CTTACCGTCC TTTGGTTGCT
TGGGATATGG ATTCAGGTAT TCTCCAATAT GCTGTGTTAG ATTTTGACGT TACCGATCAT
GCCGTATTCT CAGATGAATA TCGTGTACGA ATAGATAATG ATGTTGACCG AACTAACACT
GCTACCGCAT TCCGAGCGGA TATTGATTGG CGTGTGAGCA CAGATTTTAT TACATCTGTT
AAGGGTGGTA TACGATACTC AGAGCTAGAA TACCTAGACT TGGGTAGCGC GCGTAATGAA
TTCGAAGTTG ATCGCAATAA CGTTAACCCG GAGACGAAAG CGTTAGATGT TGCAGCGATA
TCTGCGGTGA ATACAGCATG TGCTATCGAT TTCCCAGAAA GTAACTTCCT AAGTAGCGAG
CGAAGCGGCG ATTTAGTTAC TCAACTTGCT AGTGACGGCA GCGTGGTTGG TAGTTCGAAT
AGTTGGGCGA CGTTTGACAC TGTATGTGCC GCTCAAATGA TTGCAAATTA TCGCGGTGAA
AGTTTGGAGT ACCCTGAGCT AGAAGATGGT ACCTCTCAAG TTACAGACGT TACCGAAATG
ACTACTGCTG CGTACGTAAT GGCGGACTTT GAAACTTCAC TTGCTAACAC TCCTGTTCGC
GGTAACTTTG GTGTACGTGT AGTGCAAACA GAAGTTGAAT CTGTGGGTTA CCGTACAGCA
TTTAACGTTG TAACAAACGA TTCTGGCACT TTGAAGTTAG AGCGTACCGG AGATACAGAG
CAAGTAACTG CAGGTGGCGA TTACACTGAG TTGTTGCCTA GCGTAAACGT GATTGCAGAC
CTTACTGATA ACTTGATGTT ACGTGGTGCT GTCTATCGTG GTCTATCACG CCCAGATCCT
GCTGATCTTG GTTACAAGCG TGATTTCGAT GAGAACACCG AAGACGATAT TACCGACGTA
AATCAGTTAA TCGATAGTGT TGATGGCGAC GGCAATCCCA ATATGCAGCC GCTTACTTCG
TGGAACTTTG ACGCCGCTAT CGAGTGGTAT GCGAACGACG ATACTATGTT AGGTTTCGGT
GTTTACCATA AGAAATTCCA GGGTGGTTTC GAAACCATTC GCACTACTGA GAGTTTCTTA
ATAGACGGTG TTAGCACTCA AGCAGAATTT GATCTTGTTA GCACGAACGA AGAAACGAGT
AACTTAACTG GTTTTGAGCT TAACGCTTCA CACAACTTCT CTTACTTGCC TGGCTACTGG
AGTGGCTTGG GTGTGAAGGG TAGCTTCAAC CACGCGATCT CAGATTTCGA GTTCGAAGAT
AGCAACTACG GTGATATCAC TAAGACGGAT GAGGATGGCA ATATCGTTGA GGAATACATT
GGTATAGTGG AGCCTGGTAA CGTACCAGGC TTCTCTGAGA ATGTATTCTC TGGTCAGATC
TACTACCAAA TTGGTGAGTT AGACACGAGC TTAATCTATA AGTACCGCAG CGAGTACTTC
CAGCCATACA CAAGCAATGG TACGCGTTTG CGTTATGTTA GCGATGTGGG TGTGTGGGAA
GCGCGTGTTT CTTACAAGCT AACCAAAAAC GTGAAGTTAA GCTTAGAAGC TATCAACTTG
TTTGATGAGC CTAAAAAGCA ATACTTCTAC AGCCGCGACA ACTTAGGCGA GCTAAACAGC
TACGGCCCAC GTATATTCGT AGGTGTTAAA GCTAAGTTTT AA
 
Protein sequence
MRNNNKEARQ VNTTERTPSN GRFRKSLLVS AIAAVSCINV AHAQESEGQL EEVVVTGTRA 
TIQSTIDIKR NSTTIVDGLS ATDIGDMPAL SIGEALENIT GAASHRENGG ATEISIRGLG
PYLSATTFNG RGATNGSGDR SVNFSQFPSE LVNKIAIYKT QDASLIEGGV AGLIALETVK
PLEYGKQRIQ VDAKLNFNPN QANIDDPMNG NLGFRGTVSY IDQFEIGDAA LGVSLGYQKS
DISQPESEMR SSSPSGSSLY ACLYDPDISA GFYRTSSGDC NSGGTYNTTK DPETGKAVND
GSPYGFAPSS NGYRQNDTAD ERDAVFAAIQ FQPTDKLDIN LDVQQSKRVQ AEQRHDLNFA
NMRRVTPGIT DQAVVISSTG AVTSWAGETA IESNSETYSR TEEYLGYGLN VAYELTDDIT
ISGDYSFSET TREEVQYLLR TQSNNGDVNG DSSSYRPLVA WDMDSGILQY AVLDFDVTDH
AVFSDEYRVR IDNDVDRTNT ATAFRADIDW RVSTDFITSV KGGIRYSELE YLDLGSARNE
FEVDRNNVNP ETKALDVAAI SAVNTACAID FPESNFLSSE RSGDLVTQLA SDGSVVGSSN
SWATFDTVCA AQMIANYRGE SLEYPELEDG TSQVTDVTEM TTAAYVMADF ETSLANTPVR
GNFGVRVVQT EVESVGYRTA FNVVTNDSGT LKLERTGDTE QVTAGGDYTE LLPSVNVIAD
LTDNLMLRGA VYRGLSRPDP ADLGYKRDFD ENTEDDITDV NQLIDSVDGD GNPNMQPLTS
WNFDAAIEWY ANDDTMLGFG VYHKKFQGGF ETIRTTESFL IDGVSTQAEF DLVSTNEETS
NLTGFELNAS HNFSYLPGYW SGLGVKGSFN HAISDFEFED SNYGDITKTD EDGNIVEEYI
GIVEPGNVPG FSENVFSGQI YYQIGELDTS LIYKYRSEYF QPYTSNGTRL RYVSDVGVWE
ARVSYKLTKN VKLSLEAINL FDEPKKQYFY SRDNLGELNS YGPRIFVGVK AKF