Gene Sde_3402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3402 
Symbol 
ID3966128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4337666 
End bp4338961 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content46% 
IMG OID637922499 
Producthypothetical protein 
Protein accessionYP_528869 
Protein GI90023042 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.904792 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTGT TGAAGTTAAA GGGTAGTTAC ATTCCGTTTA TATTGGTGGC GTTGTCTATT 
TGGGGGGCAT CTGCCTCTAC TCATGCTAAT CAAGTAAATG GGGTGGCGGC TTACCATGCA
TTGAGTGTTG AGCAATTTTT GGCTGTGTTG TATACGAGCT CGCCCACCAT TGAAGCCGCG
CCGTTACTCG AGAAAGACAC GCCTGCTCGA ATGGAGATTC GCGTTACTAC AAAGCGATTA
TCGCAGCGCA GATTTGTCAC TATGTGGCTA GAGAGCATGA CGGTAAGCAA TACATCTGAC
GTGGTAGAAG CACAATTAGA AAGCCTAGCC CAGTTTAATA AAATGTTTAA AGGCCGCTTT
ATTGATGGCG ACCGAATTGT GTTTGATTAC CGCCCAGATA GAGGCATGGA AGTAAGCGTT
AATGGCGTGG CATTAGGGGT AATTAAATCG GGTGACTTTT TCCGTTTATT GCTCGCATGT
TGGGTTGGTG ACGTGCCTAT TTCATCTAAT TTAAAGGCTA GCTTACTCAG CCCTGCAGTT
ATAGATAAAA CCTTGTTGGC ACGCTTTAAT GCAGTTACGC CAAGTGCCGA TCGCCGAGAA
GAAATTGTCG CGTGGACAAA AGTACCTGCT AAGCCAGAGC CGGTAGAGCA AGTACAGCCC
AAAGCCGAAG TGGTAAAAGC TGCACTGGTT AAAGCCCCTG TGCAAACACC TGTTGTTCCT
AAAAAGGCGC CTGCGCCGGT TGAGCGGCCA AAGCCTGTGG CAAAAGTGGC GGACAAACCT
GTTGCGTCCC AAATGGTAGA GGCTGTAAAA CAAGAGGAGA AAAAGCAGCT AGAGCCGAAG
CAAGTTGAGC CAAAAGCTGT AGAACCACCT AAACAAGTTG CAGAGCCGCC CAAACAAGTT
GTTGCTGCGT CTAAAGTTGT AGCCCCAGTT ACAACTGCAG AGACGGTAGT AGAGTTAGAG
TCCGACGATG AAACAATGGA AGGCTTGGAT GCGGGCGGTT TATTGGTTAG ACAAAAATAT
TACGATCAAC TATCTAAGCA CTTGATTCAG CAGCAATCTA TTCCACGCCA AGCGTTTCAG
CGGCGCTTAG AGGACGAAGT GCGTGTCTAT TTAACAATAA ACCGCAATGG CACTGTGATG
GCGGCTGAGC TAGAAACAGA ATCGAAATAT AAAATGTTTA ATCAGCAAGC GTTAGAGGCA
ATTGAAAAGG CCGGTGTGTT TCCCGCTATG CCAGAGGAAA TTAGTGGCGA TACCTTCTCC
TTTTCGGTGT TGTTAAATTA TCGCTTACCT ATTTAA
 
Protein sequence
MNVLKLKGSY IPFILVALSI WGASASTHAN QVNGVAAYHA LSVEQFLAVL YTSSPTIEAA 
PLLEKDTPAR MEIRVTTKRL SQRRFVTMWL ESMTVSNTSD VVEAQLESLA QFNKMFKGRF
IDGDRIVFDY RPDRGMEVSV NGVALGVIKS GDFFRLLLAC WVGDVPISSN LKASLLSPAV
IDKTLLARFN AVTPSADRRE EIVAWTKVPA KPEPVEQVQP KAEVVKAALV KAPVQTPVVP
KKAPAPVERP KPVAKVADKP VASQMVEAVK QEEKKQLEPK QVEPKAVEPP KQVAEPPKQV
VAASKVVAPV TTAETVVELE SDDETMEGLD AGGLLVRQKY YDQLSKHLIQ QQSIPRQAFQ
RRLEDEVRVY LTINRNGTVM AAELETESKY KMFNQQALEA IEKAGVFPAM PEEISGDTFS
FSVLLNYRLP I