Gene Sde_3541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3541 
Symbol 
ID3966383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4498131 
End bp4500449 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content50% 
IMG OID637922638 
Productgeneral secretion pathway protein J 
Protein accessionYP_529008 
Protein GI90023181 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.489396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAAA TTTACAATCG TATTGCCGAT GAACTCAATG TACAGCAGCG ACAGGTTGAG 
GCCGCGGTGG CCCTGCTTGA TGAAGGCTCT ACCGTGCCTT TTATTTCGCG TTACCGCAAA
GAGGTTACCG GCGGGCTAGA CGATACTCAG CTGCGTAACT TAGAAGAGCG CTTAACCTAC
CTGCGCGAGA TGGAAGACCG CCGCGACACC ATTCTTAAAT CCATCGCTGA GCAAGAAAAA
CTCACCCCCG AGTTAGAGCA GCAAATTAAA GGTGCCGAGA CAAAAACACA GTTAGAAGAT
TTATACCTGC CCTACAAACC CAAGCGTCGC ACCAAAGCGC AAATTGCTCG CGAAGCAGGG
TTAGAGCCAT TAGCCGATGC TCTGTTGGCC AACCCAAGCC TGGTGCCAGA AACCGAAGCG
CAAGCCTACT TTAACGAAGA GCATAAAATT ACCGATATTA AGTCGGCGCT TGATGGCGCT
AAGCAGATTT TGATGGAGCG TTTTAGTGAA GATGCCGTTC TGCTTAACAA GATGCGTCAA
TTCCTAAAGC AAGAAGGTTA TATAAGTGCT GCCGTGGTTG AGGGCAAAGA GGCTGAGGGC
GCGAAATTCC AAGATTATTT TGAGCACAAA GAGCCGTTGG CCAATACCCC TTCGCACCGC
GCACTGGCTA TGTTCCGCGG CCGCAATGAA GGCGTGCTTA CGATGGTGCT TACCTTAGAT
AATGAGCTAG AGCCAGGTCA GCGTCACCCG TGTGAGACTA TGGTTGCCTC TCACTGGCAA
ATAGAAGACC AAGGCCGCCC TGCGGATGCA TGGCTTGCCG AAGTGGTGCG TTGGACTTGG
CGAGTAAAAT TAAGCACGCA CTTAGAAACA GACTTATTTA GTGAATTGCG CGAGCGTGCA
GAGGCGGATG CTATTGATGT ATTTGCCCAC AACCTAAAAG ACTTGTTGCT TGCAGCTCCT
GCAGGGCAAA AAGTGACTAT TGGTTTAGAC CCTGGTTTGC GCACGGGTGT AAAACTGGCG
GTTGTTGGCG CTACCGGTGA AATATTAGAT CACGGTGCTA TATTCCCTAC ACCTCCGCAA
AACCGCATCG CCGAAGCAGA AGCTGTGATT GTGGCTTTAT GTAAAAAACA TAACGTGGGC
TTAATTGCGA TTGGTAACGG TACTGCTAGC CGCGAGACCG ATAAATTTGT GGGGGATTTA
CTTAAAAAGC ACAAAGATAT TAAAGCGCAA AAAGTAATGG TGAACGAGGC GGGTGCGTCG
GTTTACTCTG CATCTGAGTT TGCTGCGAAA GAATTCCCAG ATTTGGATGT TACGATTCGC
GGCGCTATTT CCATTGCTCG TCGTCTGCAA GACCCGCTAG CCGAGCTTGT GAAAATTGAC
CCTAAATCGA TTGGTGTGGG CCAGTACCAG CACGATGTAT CACAAAGCCG TTTGGCTAAG
TCGCTAGATG CCGTAGTAGA AGATTGTGTG AACGGTGTGG GGGTGGAAAT CAACACCGCT
TCGGCGCCAC TATTGGCGCG GGTGTCGGGC CTAAGCTCTT CTATCGCCAA CAATATTGTG
GAGTTCCGCC ACAAAAACGG TGGTTTTAAA AGCCGTGAGC AACTCAAAGA AGTAAGTCGC
TTGGGGGCCA AAGCGTTTGA GCAGGCTGCC GGCTTCTTGC GTATTGCTGG GGCAGAAAAC
CCATTGGATG CCTCTGGTGT TCACCCCGAG TCGTATACGG TGGTAGAGCA AATCGCCGCG
AAAAACCAAC GCGAATTGCG CGGTTTAATT GGCGATGCGG GCTTTTTACG CAGTCTAAAA
CCTGCAGAGT ACACCAACGA GCAATTTGGT TTACCTACGG TGACCGATAT TATTGCCGAG
CTAGAAAAAC CCGGTCGCGA TCCGCGCCCA GAGTTTAAAA CGGCGCAATT CCAAGATGGT
GTTGAAACCA TAAAAGATTT GGAGCCGGGT ATGATTTTGG AAGGCACGGT AACTAACGTG
ACTAACTTCG GCGCGTTTGT CGATGTAGGT GTGCATCAAG ATGGTTTGGT ACATATTTCT
GCACTTTCAA ACACTTTTGT TAAAGACCCC CGCGAAGTGG TAAAAGCCGG CGACATTGTG
AAAGTTAAAG TGATGGAAGT GGATGTGCCG CGCAAGCGAA TCGCCATGTC TATGCGTATG
GACGATGAGC CAGGTGAAAA GGTAACAGGT CGCCCAGCTG GCGGAGCCGG TGGTCAGGGC
GCGAAACAGG CGCGTCGTCA CAACACCAAA CAGCAAGCTG CTCAGCCTTC AGGCACCATG
GCAGCACTGT TTCAGCAGGC GCTTAACAAG AAAAAGTAA
 
Protein sequence
MQQIYNRIAD ELNVQQRQVE AAVALLDEGS TVPFISRYRK EVTGGLDDTQ LRNLEERLTY 
LREMEDRRDT ILKSIAEQEK LTPELEQQIK GAETKTQLED LYLPYKPKRR TKAQIAREAG
LEPLADALLA NPSLVPETEA QAYFNEEHKI TDIKSALDGA KQILMERFSE DAVLLNKMRQ
FLKQEGYISA AVVEGKEAEG AKFQDYFEHK EPLANTPSHR ALAMFRGRNE GVLTMVLTLD
NELEPGQRHP CETMVASHWQ IEDQGRPADA WLAEVVRWTW RVKLSTHLET DLFSELRERA
EADAIDVFAH NLKDLLLAAP AGQKVTIGLD PGLRTGVKLA VVGATGEILD HGAIFPTPPQ
NRIAEAEAVI VALCKKHNVG LIAIGNGTAS RETDKFVGDL LKKHKDIKAQ KVMVNEAGAS
VYSASEFAAK EFPDLDVTIR GAISIARRLQ DPLAELVKID PKSIGVGQYQ HDVSQSRLAK
SLDAVVEDCV NGVGVEINTA SAPLLARVSG LSSSIANNIV EFRHKNGGFK SREQLKEVSR
LGAKAFEQAA GFLRIAGAEN PLDASGVHPE SYTVVEQIAA KNQRELRGLI GDAGFLRSLK
PAEYTNEQFG LPTVTDIIAE LEKPGRDPRP EFKTAQFQDG VETIKDLEPG MILEGTVTNV
TNFGAFVDVG VHQDGLVHIS ALSNTFVKDP REVVKAGDIV KVKVMEVDVP RKRIAMSMRM
DDEPGEKVTG RPAGGAGGQG AKQARRHNTK QQAAQPSGTM AALFQQALNK KK