Gene Sde_4000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_4000 
Symbol 
ID3967419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp5034112 
End bp5037030 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content47% 
IMG OID637923097 
Producthypothetical protein 
Protein accessionYP_529467 
Protein GI90023640 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.463681 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCTA AAAAAACGCC TTTGGCTAGC TTTTTACAAG CTTCCTTACA AAAACTTAAT 
GCAACCACCA CACACGCAGC AGCCATAGGT GCTGTAACGT TATTAAGCCA AACGGCGTTT
GCCGATGGCA GTGTTAGCGG TAAGTTGTCC GATAAAAGTG GCAGCTACGG TATAGAAGGT
GCGTTAGTTA CAGTTAAAGA ATTGAATGTA CAAACCAGTA CTAACCGCGA TGGTCGATAC
AACTTGCCTT CACTGCCCAA TGGCGAGTAC ACCCTAGAGT TAAGTTATGT GGGTGCCGAT
ACTCAAACAC GCGCCATTGC CATTACCGAT AACGCCATTT TGGTGGAACA CTTTACTCTG
TCTTCAAACG CTACCGAGCA CGTATTGGTA ATTGGTGAAG CGGCGAGTTT AAACAAAGCG
CTTAACAAAC AGCGTGCGTC TAACAATATT GTTAGCATTG TAAATGCCGA TGCGATAGGT
AATTACCCAG ATGCAAACAC CTCTGAAGCC TTGCAGCGCT TGCCCGGTGT ATCAGTAGAA
AACGATCAAG GCGAAGGCCG CTACGTGCGC ATTCGCGGCT TGGGAGCCAA TTTTAACTCG
GTAACCATTA ACGGTAGCAA GGTGCCCTCC CCTAATGCGG GTGAGCGCGC AGTAGCGTTA
GATACCGTAC CGTCAGAGTT GTTGGAATCG CTCGAGGTAA CAAAAACCCT CACACCAGAT
ATGGATGCCG ACTCGTTAGG CGGCACCATT AACGTAAAAA GCTTAAGCGC ATTCGATCGC
GATGGCCGCT TTTATAAAGC AACGCTAGAA ACCAACTTTG ATGAACACAC ATCACAAGCC
AGCCCCAAAT TATCGTTAAC CGCTAGCGAC CTTTTTAGTA TTGGCGATGG CGCAGATAAT
TTTGGTGTTG CGGGTGCGCT TAGCTGGTAT GAACGCAAGT TCGGTTCCGA CAACGTAGAA
ACAGGCGGCG CGTGGGACGT TGCCGACAAT GCCGCAGACA CCCGCCTAGA AGAATTCGAA
CAAAGAGATT ACACCATTAG CCGCGAACGC CTAGGCGCAT CGCTTAATTT CGATTACCAA
GCTAACGATG TAACTAACTT ATTCTTACGC ACCCTTTACT CCGAATATAC CGATGCCGAA
GAGCGCCAAG CCAACGTGAT AGAAATGTAT CAGTTCGAGC GCGACGACTT GGGCGACATT
GTGCTGGATG ACGAAGGCGA TGAAGAAACC AACGACGGTA TACCTGCTAG TGAAACCGGC
GCAGCGACCA TTGCCAGAGA GCTAAAAGAT CGCACCGAAA CCATGAAAAT TAAATCGTTG
GTGTTAGGCG GTAAAAGCGA GTGGGATACG TGGGTAATAG ATTACAAGCT GGGTGCAAGC
CAATCTAGCG AGCACACGCC ATTTCATATT GATGGTGCGG TATTTGAAGC CGACTTCGAA
GAAGGTATAG GTTACAGCGC GGCTAAACAA ATTTACTTGC ATGCGCCTGC CGAAGTATAT
GCTGCAGAGA ACTACGAGCT AGATGAAATA GCTACCGCAC AACAAATTAC CAAAGACAAA
GAAAACAATA TTCAGCTAGA TATTACTAAG CGCTTACAGT TGGCAGGCAA CCCCCTTGCA
TTAAAGTGGG GGGCAAAACT AAGCCAGCGC GAAAAAACCA GTGACGAAGA AGTATGGGCA
TATTCAAACC TAGACGAAGT AGGCTTTACC GATGAGCAAC TATTGTTAAG TAACTACGTA
GCAACTAATA GTGGCGAAGC CCAAATAGAC TACCAGTTTG GGCCTATGGG CCAAGCTATT
GCAAGTCGCC CCCTGCTAAA TGCCATTAGC CCACTGGATA AAAGCGAATA TCACGAAGAA
ATAGATTCAG CCATAGCGGA TTTTACCGTT AACGAAGATA TTTCAGCCGC GTATGTAATG
GGTACCCTAG ATGTAGATAA ACTGCGTATT CTTGCTGGTG TGCGTGTAGA GCAAACAGAT
TTTACCGCCG CCGGTATGCA CTACAATGAA TACGAAGATG AAGGCGCTGA AATTGAAGAG
CTAACGGCAG CCTCTTTTAA CAACGATTAC AGCCATACAT TACCTTCACT GCACCTACGC
TATCTAATTG GCGAAAAAGT ACAATTGCGC GCAGCTTGGA CCAATGCCGT TGTGCGGCCA
ACCTTCGAAC AACTAAGCCC AGCAAAAGTG CGCGAAGGCG ACGAAGTGGA ATTTGGCAAC
CCACTACTTA AACCACTAGA AGCCTCGAAC CTAGATATTG GTATTGAATA CTACGGCGGT
TTTGCTAGCT ACTTTTCTGC GTTTGTATTC AGTAAAGATA TAGAAAACTT TGCGTACGCC
ATAGACCTAG GCGCCACTGC AGATGCAAGC CTTGTAGGCC CTGGCGAAAT ATCTGAAGCT
AACACTTTTA AAAATGGCGA TAGCGCCACC CTTACAGGTT TGGAGCTAGC CGCGAGTAAA
CAATTTTCGG AACTACCCGC CCCGTGGGAT GGCTTTTTAG TAAGCGCCAA CGCTACTTGG
ACCAGCTCTG AAGCGACTAT CGAGTATTTA GATGACGAAC TGTTTGCGCA GCGCGACATA
CACTTGCCAT CGCAATCTGA TTTTTCCGGT AACTTTGCTA TTGGTTATGA AACAGAAAAA
TTCAGCTTAC GTGTAGCGGC AAACTATAAA AGTGAATATC TGTTAGAAGT AAGTGACCCC
GCCGATGCAC AGGGCGATGC TTGGGTAGAT GCACAAACAG GTATCGACTT TCTAGCGCGT
TGGTATGCCA GCGAAAAAGT ACAGGTATTT GTGCAGGGTG TTAACCTAGG CGACCAAGGT
TACTATGTGT ATAGCGGCGA AGGCCAAAAA GATTACAACT TCCAATACGA AGAGTATGGC
CCAAGTTTTA GACTAGGCGT AACAATTACT GATTTTTAA
 
Protein sequence
MRAKKTPLAS FLQASLQKLN ATTTHAAAIG AVTLLSQTAF ADGSVSGKLS DKSGSYGIEG 
ALVTVKELNV QTSTNRDGRY NLPSLPNGEY TLELSYVGAD TQTRAIAITD NAILVEHFTL
SSNATEHVLV IGEAASLNKA LNKQRASNNI VSIVNADAIG NYPDANTSEA LQRLPGVSVE
NDQGEGRYVR IRGLGANFNS VTINGSKVPS PNAGERAVAL DTVPSELLES LEVTKTLTPD
MDADSLGGTI NVKSLSAFDR DGRFYKATLE TNFDEHTSQA SPKLSLTASD LFSIGDGADN
FGVAGALSWY ERKFGSDNVE TGGAWDVADN AADTRLEEFE QRDYTISRER LGASLNFDYQ
ANDVTNLFLR TLYSEYTDAE ERQANVIEMY QFERDDLGDI VLDDEGDEET NDGIPASETG
AATIARELKD RTETMKIKSL VLGGKSEWDT WVIDYKLGAS QSSEHTPFHI DGAVFEADFE
EGIGYSAAKQ IYLHAPAEVY AAENYELDEI ATAQQITKDK ENNIQLDITK RLQLAGNPLA
LKWGAKLSQR EKTSDEEVWA YSNLDEVGFT DEQLLLSNYV ATNSGEAQID YQFGPMGQAI
ASRPLLNAIS PLDKSEYHEE IDSAIADFTV NEDISAAYVM GTLDVDKLRI LAGVRVEQTD
FTAAGMHYNE YEDEGAEIEE LTAASFNNDY SHTLPSLHLR YLIGEKVQLR AAWTNAVVRP
TFEQLSPAKV REGDEVEFGN PLLKPLEASN LDIGIEYYGG FASYFSAFVF SKDIENFAYA
IDLGATADAS LVGPGEISEA NTFKNGDSAT LTGLELAASK QFSELPAPWD GFLVSANATW
TSSEATIEYL DDELFAQRDI HLPSQSDFSG NFAIGYETEK FSLRVAANYK SEYLLEVSDP
ADAQGDAWVD AQTGIDFLAR WYASEKVQVF VQGVNLGDQG YYVYSGEGQK DYNFQYEEYG
PSFRLGVTIT DF