Gene Sde_3200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3200 
Symbol 
ID3965673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4076735 
End bp4078285 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content51% 
IMG OID637922297 
Productdystroglycan-type cadherin-like 
Protein accessionYP_528669 
Protein GI90022842 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGCAA TCCTGTATCC TGCGCGCCCT TTAGTTGATG CAGACCCCAA TATGAAATTT 
GCCGAACTTG GCCTTTGTCC CGCCATTCAG AAAGCCGTAC TCGAACAAGG CTACGAAACC
CCTACCCCAA TTCAAGCGCA AGCCATTCCC CCTGTATTAG AAGGGCGCGA TGTGATGGCC
GCTGCGCAAA CCGGTACGGG TAAGACGGCG GGGTTTACAT TGCCCATTTT AGAAATACTG
GCAGAAGGCA TAGAAAACGG CCGTAAAGTA AAGCCGAATC AGGCACGTGC GTTAGTACTT
ACTCCTACGC GAGAGCTTGC AGCACAGGTA GGTGAAAACG TTGCCCTATA CGGCAAGTAC
TTGCCTATTA AATCTACCAT CGTATTTGGC GGCGTGAAAA TTAACCCGCA AATGATGAAA
TTGCGCGGCG GTGTCGACAT TTTAGTGGCC ACGCCAGGCC GTTTGATGGA CCTATACAAT
CAGCGTGCAG TGAAGTTTGA TCAGCTAGAA ATGCTGGTGT TGGACGAAGC CGATCGCATG
CTCGATATGG GCTTTATTCA CGATATTCGC AAAATTATGG CTATTTTACC GAAGAAACGT
CAAAACCTGA TGTTTTCGGC GACGTTCTCG CAAGATATTC GCGAATTAGC TAAAAGTATT
GTGAATAACC CAGTAGAAAT TACTGTGAAC CCGCCCAACA GCACCGCAAC CCGCGTAAAA
CAGTGGATTT GCCCCGTTGA TAAAAAAGAA AAGCCCGCTT TGCTTACCCA TTTGATTAAA
ACCAATAAGT GGCAGCAAGT GTTGGTGTTC TCTCGCACCA AGCATGGCGC AAATAAATTA
GTTAAACAAT TGGAAGGCAG TGGCCTGCGA GCAGCGGCGA TTCACGGCAA CAAAAGCCAA
GGCGCACGCA CTAAAGCGTT AGCCGAGTTT AAAAATGGCA CGGTAAAAAT TCTTGTAGCC
ACCGATATTG CCGCTCGCGG TTTGGATATT GATCAACTAC CGCAAGTGGT GAACTTCGAC
TTACCTCAGG TTGCTGAAGA TTACGTGCAT CGAATTGGCC GCACAGGCCG TGCTGGCGCA
GAGGGCAATG CCGTTTCGCT AGTGAGTGCC GACGAATTTC AAATGTTAAA AGAGATTGAG
CGTTTAACTA AAACGTTGCT CACTCGCGAA GTTATTCAAG GCTTTGAGCC AGACCACAAT
TTACCTGAGT CTCGATTAGA TACCCGCCCC ATTCGCCCCA ATAAGCCGAA ACGGCCCAAG
CCAGCAGGTG GTGCTTCGAA CCGCAGTGGC GGTGGTAACA GTGGCGGTGG CAATAATCGC
GCTAAGCCGC GTGGCGATGA CAGCAAAGCG GCCAATGCCG ATACCCCGTG GGCAGGCAAA
GCAAAGCCGC GCAAGCCTCG CCCTGCAGGT GCTAAACCCG CTACACAAGG TAAGCCTGCC
GGCGCGCGCA ATGGCAATAC AGGCCCAAAA GGTAAGGGTG GGCCAGCCGG TGGGGGTGCC
AGACGACCAA CTAATAAGCC TAGCGGTGCA CAAGGGGCTG CTAGAAGTTA G
 
Protein sequence
MAAILYPARP LVDADPNMKF AELGLCPAIQ KAVLEQGYET PTPIQAQAIP PVLEGRDVMA 
AAQTGTGKTA GFTLPILEIL AEGIENGRKV KPNQARALVL TPTRELAAQV GENVALYGKY
LPIKSTIVFG GVKINPQMMK LRGGVDILVA TPGRLMDLYN QRAVKFDQLE MLVLDEADRM
LDMGFIHDIR KIMAILPKKR QNLMFSATFS QDIRELAKSI VNNPVEITVN PPNSTATRVK
QWICPVDKKE KPALLTHLIK TNKWQQVLVF SRTKHGANKL VKQLEGSGLR AAAIHGNKSQ
GARTKALAEF KNGTVKILVA TDIAARGLDI DQLPQVVNFD LPQVAEDYVH RIGRTGRAGA
EGNAVSLVSA DEFQMLKEIE RLTKTLLTRE VIQGFEPDHN LPESRLDTRP IRPNKPKRPK
PAGGASNRSG GGNSGGGNNR AKPRGDDSKA ANADTPWAGK AKPRKPRPAG AKPATQGKPA
GARNGNTGPK GKGGPAGGGA RRPTNKPSGA QGAARS