Gene Sde_4006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_4006 
Symbol 
ID3967425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp5042840 
End bp5045794 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content46% 
IMG OID637923103 
ProductTonB-dependent receptor 
Protein accessionYP_529473 
Protein GI90023646 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000010552 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.168301 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGAT CAATAACCTT TAAACGCAGA TTGTTGCCTA TATGCATTTC TGCTGTGGCA 
TTTAGTGGTA TTACATCTAG CACTATTGCC CAAGAAGAGA ATTACGAAGA AGTAATTGTT
ACCGGTATTC GTGGCGCATT AACCAGAGCC ATTGATGTTA AGCGCGATGC TGGGTCTGTT
GTGGATGCTA TTAGCGCGGA AGATATTGGT AAATTACCCG ATGCCACCAT TGCGGACTCG
CTCCAGCGTG TAACAGGTAT ACAAATTCGT CGCAGTGCTG GTGAAGGCTC CACTGTAAAC
GTTCGCGGTA TGCCGCAGGT TAATACCTTA CTTAACGGCG AGCAATTTTT AAGTGCCGGT
TCTATTACAA CTTTGCAGCC AAACTTTACA GATATTCCCG CAGAGCTTTT ATCGCGTGTG
GATGTTATTA AATCGTCAGA GGCTTCTACT TTATCTGGCG GTGTTGCAGG CACAATCGAT
CTTCGTACGC AGCGACCATT AGATCTTGCA GAAGGTTGGA CTTTTGTAGG TGCCGGTGAG
CTTTCCGATG GTTCTTACAC TGATGATAAC GGCACCAAAC TTTCTGGTTT TGCCGGTTAT
CATTCAGATG ATTTTGGTGC GTTATTAAGT GTATCTACTT CTAGTGCTAC CCTCGCAAAC
TTCCGCTACG GTATGTATAA CGATTGGTGG TTCCGCGGCT ACCAAGAAGA TGGTAACTGG
CCGGGCTGGT CTACACCCAC AGATGTAACC GGCGATGGAG ACACTAACGA CGCGATATTC
GGCACAATTG ATTATGGTGT AACTAATCGC ACATCTGAGC GAGATAGAAC AGGTATTTCA
GCAACTGTGC AATATCGTGT GAACGACAAA GTAGAAGTAT TGGGCGATGT TTTCTATACA
TCTATGGATC AATACGAGCA CACCAATGGC TTAGTAGCAG ATAACGCTTG GGCGCAATAT
GATTGGGTAT ACCCACAAAA CCCCGTTAAT CGTGGCCCGA GTGCAGACGG CAACACAGAT
AAAGATTTTT ATACTGCGTC TGTATTCGAT CTGCATGCAT TGCGCGTAAC AGCGAAAGCA
GAAAGTTTTG TAGATAAAAG AGAATCTACC AATATTAACC TGCAAACAAA TATCGATTTT
ACCGACAGCT TCCGAGCAAG TGTGCGATAT ATTCATGGCT CGGCAGAAAA TAAACACACA
GGTAATTTTG CCGATGCGTT TATTACTACC GGTGAACAGC ACGGCCTGCA AACGCGTGTA
GATAACGTAA CAGAAACCGT TAACCCTAAT GGTGAAGGCC CAGATCGCAT TGTCATACGT
GGGGATATGT CTGGTACGCA CCCTTCGTTT ACTTACCCAG AAGGTTTTGG CGATAGCATC
GAAAAATACG GCTTGGTGTC ATCGTTCTCT CATCAAAATC GCGATGAAGA ATCAAAATTA
GATGTGTTGC GCTTTGACGG CATTTTAGAT CTAAACGATA ACAACTCACT CGAGTTTGGC
TATCGTTACG GCAAGCGAGA AGTCACTCGC TACCAGTACG ATTACGTTGC GCCTTTTACT
CGTCGCGGCA TGGATGACGA GCAAATTACA GTGTATTCAA AATGGAAGGA TTCTGGTTTA
CCGGTAAATG GTGACCCAGG TGCAGGTGTG TTTGGTGACA CCATTGCTCG CACAATTCCG
TTTACAGAGC TAGATGCAAT GGGGTGGATT ACCGAAGTAA GCGATTTTGG CCCAGCACCT
AGTGACGGTC GCAGCTTTTA CTTTATTGAC CCTAAAGCCA TGGACGACGC ACTTGGCTTT
CACAATACGC TCTACCCTGG CAATGTTGCT ATTAAAGACC CAGGTAGAAG CTACGAGCTA
GACGACAAAA CGCATACCCT TTATGCACAG GCCAATTTTG AAGGCGAGTT TGGTGTACCT
TATCAAGCGA ACTTTGGTGT GCAGTACATT CGCACGTTTT TGGATGTAAC GACCAATGTA
CCGGGTGTGG AACCTATCGT TGAGGTTGAT GGCGTTGAGT ACCCTACTTT AAGCGGAACA
CCGCCACAGG ATTTAGGTGA CTCTACCGTA GAGCGTAGTT TTACCGACTT TCTACCGCGT
TTTAATATCG GCTTTGATAC TAGCGAAAAT ACAAAATTGC GTTTGGCTTA CACCAAAACT
ATGACTCAAT TAGATGCCAA TGATTTAGGC TTGGGTTTAG TTTACACCGT TAACAACAAC
GCCGACCTTG GCGTATTCCA AGCTGTAAGC GCCTCGCAAG ATGGCAACCC CTATATGGAG
CCATGGCGTG CAGAAAACTA CGACGCAACC TTCGAGTGGT ATTTTGCTGA ATCGAGTATG
GCAAGTATTG GTTTATATCG CTTAGATGTT GCCACTTCTA TAACCACAAC AGGTACGACT
ACAGCCGCAG TGCCAGATTC CGATGGCGTA ATTCGCGATG AAGATGGCGA GATAGGATTA
ACCATTCGCG ACAATACCGA CGGTGGCGTT GTGCAAGGTA TAGAGCTTGG CTATCAACAG
GCGTTCGATT TTTTGCCTGG TGCATTTAGC GGTTTAGGTA CCACGTTAAA CTACACATGG
GCAGATGGTG AAGGTGGCGA TAAAGACTTC TACGGCGCAA CCATGCCAAT GGGAGATAAC
TCCGAGCACC AGTTTAATGC AATATTGTGG TACGAAATGG ATGGGTGGCA AGCGCGTGTT
GCAATGAACT ATCGCAGCGA ACGTTATATT GGTCGCGCGT GGAACGATGG CCACCCAGCA
GCTTGGTGGT CTGCACCAAC CACGTATGTT GATGCATCGG TAAGTTACGA TATTACCGAT
GGTATAACCG TATTCCTACA GGGAACCAAT ATTACCGAAG AATACGAAGA AACCTATATG
CAGTGGCAGG ATGTAGTGGT AAACCAGAAC GTGTTTGAAG CACGTTACAA CCTAGGTGTG
CGAGCTAGAT TCTAA
 
Protein sequence
MSRSITFKRR LLPICISAVA FSGITSSTIA QEENYEEVIV TGIRGALTRA IDVKRDAGSV 
VDAISAEDIG KLPDATIADS LQRVTGIQIR RSAGEGSTVN VRGMPQVNTL LNGEQFLSAG
SITTLQPNFT DIPAELLSRV DVIKSSEAST LSGGVAGTID LRTQRPLDLA EGWTFVGAGE
LSDGSYTDDN GTKLSGFAGY HSDDFGALLS VSTSSATLAN FRYGMYNDWW FRGYQEDGNW
PGWSTPTDVT GDGDTNDAIF GTIDYGVTNR TSERDRTGIS ATVQYRVNDK VEVLGDVFYT
SMDQYEHTNG LVADNAWAQY DWVYPQNPVN RGPSADGNTD KDFYTASVFD LHALRVTAKA
ESFVDKREST NINLQTNIDF TDSFRASVRY IHGSAENKHT GNFADAFITT GEQHGLQTRV
DNVTETVNPN GEGPDRIVIR GDMSGTHPSF TYPEGFGDSI EKYGLVSSFS HQNRDEESKL
DVLRFDGILD LNDNNSLEFG YRYGKREVTR YQYDYVAPFT RRGMDDEQIT VYSKWKDSGL
PVNGDPGAGV FGDTIARTIP FTELDAMGWI TEVSDFGPAP SDGRSFYFID PKAMDDALGF
HNTLYPGNVA IKDPGRSYEL DDKTHTLYAQ ANFEGEFGVP YQANFGVQYI RTFLDVTTNV
PGVEPIVEVD GVEYPTLSGT PPQDLGDSTV ERSFTDFLPR FNIGFDTSEN TKLRLAYTKT
MTQLDANDLG LGLVYTVNNN ADLGVFQAVS ASQDGNPYME PWRAENYDAT FEWYFAESSM
ASIGLYRLDV ATSITTTGTT TAAVPDSDGV IRDEDGEIGL TIRDNTDGGV VQGIELGYQQ
AFDFLPGAFS GLGTTLNYTW ADGEGGDKDF YGATMPMGDN SEHQFNAILW YEMDGWQARV
AMNYRSERYI GRAWNDGHPA AWWSAPTTYV DASVSYDITD GITVFLQGTN ITEEYEETYM
QWQDVVVNQN VFEARYNLGV RARF