Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_4000 |
Symbol | |
ID | 3967419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 5034112 |
End bp | 5037030 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637923097 |
Product | hypothetical protein |
Protein accession | YP_529467 |
Protein GI | 90023640 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.463681 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCTA AAAAAACGCC TTTGGCTAGC TTTTTACAAG CTTCCTTACA AAAACTTAAT GCAACCACCA CACACGCAGC AGCCATAGGT GCTGTAACGT TATTAAGCCA AACGGCGTTT GCCGATGGCA GTGTTAGCGG TAAGTTGTCC GATAAAAGTG GCAGCTACGG TATAGAAGGT GCGTTAGTTA CAGTTAAAGA ATTGAATGTA CAAACCAGTA CTAACCGCGA TGGTCGATAC AACTTGCCTT CACTGCCCAA TGGCGAGTAC ACCCTAGAGT TAAGTTATGT GGGTGCCGAT ACTCAAACAC GCGCCATTGC CATTACCGAT AACGCCATTT TGGTGGAACA CTTTACTCTG TCTTCAAACG CTACCGAGCA CGTATTGGTA ATTGGTGAAG CGGCGAGTTT AAACAAAGCG CTTAACAAAC AGCGTGCGTC TAACAATATT GTTAGCATTG TAAATGCCGA TGCGATAGGT AATTACCCAG ATGCAAACAC CTCTGAAGCC TTGCAGCGCT TGCCCGGTGT ATCAGTAGAA AACGATCAAG GCGAAGGCCG CTACGTGCGC ATTCGCGGCT TGGGAGCCAA TTTTAACTCG GTAACCATTA ACGGTAGCAA GGTGCCCTCC CCTAATGCGG GTGAGCGCGC AGTAGCGTTA GATACCGTAC CGTCAGAGTT GTTGGAATCG CTCGAGGTAA CAAAAACCCT CACACCAGAT ATGGATGCCG ACTCGTTAGG CGGCACCATT AACGTAAAAA GCTTAAGCGC ATTCGATCGC GATGGCCGCT TTTATAAAGC AACGCTAGAA ACCAACTTTG ATGAACACAC ATCACAAGCC AGCCCCAAAT TATCGTTAAC CGCTAGCGAC CTTTTTAGTA TTGGCGATGG CGCAGATAAT TTTGGTGTTG CGGGTGCGCT TAGCTGGTAT GAACGCAAGT TCGGTTCCGA CAACGTAGAA ACAGGCGGCG CGTGGGACGT TGCCGACAAT GCCGCAGACA CCCGCCTAGA AGAATTCGAA CAAAGAGATT ACACCATTAG CCGCGAACGC CTAGGCGCAT CGCTTAATTT CGATTACCAA GCTAACGATG TAACTAACTT ATTCTTACGC ACCCTTTACT CCGAATATAC CGATGCCGAA GAGCGCCAAG CCAACGTGAT AGAAATGTAT CAGTTCGAGC GCGACGACTT GGGCGACATT GTGCTGGATG ACGAAGGCGA TGAAGAAACC AACGACGGTA TACCTGCTAG TGAAACCGGC GCAGCGACCA TTGCCAGAGA GCTAAAAGAT CGCACCGAAA CCATGAAAAT TAAATCGTTG GTGTTAGGCG GTAAAAGCGA GTGGGATACG TGGGTAATAG ATTACAAGCT GGGTGCAAGC CAATCTAGCG AGCACACGCC ATTTCATATT GATGGTGCGG TATTTGAAGC CGACTTCGAA GAAGGTATAG GTTACAGCGC GGCTAAACAA ATTTACTTGC ATGCGCCTGC CGAAGTATAT GCTGCAGAGA ACTACGAGCT AGATGAAATA GCTACCGCAC AACAAATTAC CAAAGACAAA GAAAACAATA TTCAGCTAGA TATTACTAAG CGCTTACAGT TGGCAGGCAA CCCCCTTGCA TTAAAGTGGG GGGCAAAACT AAGCCAGCGC GAAAAAACCA GTGACGAAGA AGTATGGGCA TATTCAAACC TAGACGAAGT AGGCTTTACC GATGAGCAAC TATTGTTAAG TAACTACGTA GCAACTAATA GTGGCGAAGC CCAAATAGAC TACCAGTTTG GGCCTATGGG CCAAGCTATT GCAAGTCGCC CCCTGCTAAA TGCCATTAGC CCACTGGATA AAAGCGAATA TCACGAAGAA ATAGATTCAG CCATAGCGGA TTTTACCGTT AACGAAGATA TTTCAGCCGC GTATGTAATG GGTACCCTAG ATGTAGATAA ACTGCGTATT CTTGCTGGTG TGCGTGTAGA GCAAACAGAT TTTACCGCCG CCGGTATGCA CTACAATGAA TACGAAGATG AAGGCGCTGA AATTGAAGAG CTAACGGCAG CCTCTTTTAA CAACGATTAC AGCCATACAT TACCTTCACT GCACCTACGC TATCTAATTG GCGAAAAAGT ACAATTGCGC GCAGCTTGGA CCAATGCCGT TGTGCGGCCA ACCTTCGAAC AACTAAGCCC AGCAAAAGTG CGCGAAGGCG ACGAAGTGGA ATTTGGCAAC CCACTACTTA AACCACTAGA AGCCTCGAAC CTAGATATTG GTATTGAATA CTACGGCGGT TTTGCTAGCT ACTTTTCTGC GTTTGTATTC AGTAAAGATA TAGAAAACTT TGCGTACGCC ATAGACCTAG GCGCCACTGC AGATGCAAGC CTTGTAGGCC CTGGCGAAAT ATCTGAAGCT AACACTTTTA AAAATGGCGA TAGCGCCACC CTTACAGGTT TGGAGCTAGC CGCGAGTAAA CAATTTTCGG AACTACCCGC CCCGTGGGAT GGCTTTTTAG TAAGCGCCAA CGCTACTTGG ACCAGCTCTG AAGCGACTAT CGAGTATTTA GATGACGAAC TGTTTGCGCA GCGCGACATA CACTTGCCAT CGCAATCTGA TTTTTCCGGT AACTTTGCTA TTGGTTATGA AACAGAAAAA TTCAGCTTAC GTGTAGCGGC AAACTATAAA AGTGAATATC TGTTAGAAGT AAGTGACCCC GCCGATGCAC AGGGCGATGC TTGGGTAGAT GCACAAACAG GTATCGACTT TCTAGCGCGT TGGTATGCCA GCGAAAAAGT ACAGGTATTT GTGCAGGGTG TTAACCTAGG CGACCAAGGT TACTATGTGT ATAGCGGCGA AGGCCAAAAA GATTACAACT TCCAATACGA AGAGTATGGC CCAAGTTTTA GACTAGGCGT AACAATTACT GATTTTTAA
|
Protein sequence | MRAKKTPLAS FLQASLQKLN ATTTHAAAIG AVTLLSQTAF ADGSVSGKLS DKSGSYGIEG ALVTVKELNV QTSTNRDGRY NLPSLPNGEY TLELSYVGAD TQTRAIAITD NAILVEHFTL SSNATEHVLV IGEAASLNKA LNKQRASNNI VSIVNADAIG NYPDANTSEA LQRLPGVSVE NDQGEGRYVR IRGLGANFNS VTINGSKVPS PNAGERAVAL DTVPSELLES LEVTKTLTPD MDADSLGGTI NVKSLSAFDR DGRFYKATLE TNFDEHTSQA SPKLSLTASD LFSIGDGADN FGVAGALSWY ERKFGSDNVE TGGAWDVADN AADTRLEEFE QRDYTISRER LGASLNFDYQ ANDVTNLFLR TLYSEYTDAE ERQANVIEMY QFERDDLGDI VLDDEGDEET NDGIPASETG AATIARELKD RTETMKIKSL VLGGKSEWDT WVIDYKLGAS QSSEHTPFHI DGAVFEADFE EGIGYSAAKQ IYLHAPAEVY AAENYELDEI ATAQQITKDK ENNIQLDITK RLQLAGNPLA LKWGAKLSQR EKTSDEEVWA YSNLDEVGFT DEQLLLSNYV ATNSGEAQID YQFGPMGQAI ASRPLLNAIS PLDKSEYHEE IDSAIADFTV NEDISAAYVM GTLDVDKLRI LAGVRVEQTD FTAAGMHYNE YEDEGAEIEE LTAASFNNDY SHTLPSLHLR YLIGEKVQLR AAWTNAVVRP TFEQLSPAKV REGDEVEFGN PLLKPLEASN LDIGIEYYGG FASYFSAFVF SKDIENFAYA IDLGATADAS LVGPGEISEA NTFKNGDSAT LTGLELAASK QFSELPAPWD GFLVSANATW TSSEATIEYL DDELFAQRDI HLPSQSDFSG NFAIGYETEK FSLRVAANYK SEYLLEVSDP ADAQGDAWVD AQTGIDFLAR WYASEKVQVF VQGVNLGDQG YYVYSGEGQK DYNFQYEEYG PSFRLGVTIT DF
|
| |