Gene Sde_3587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3587 
Symbol 
ID3966449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4546841 
End bp4547827 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content52% 
IMG OID637922684 
Producthypothetical protein 
Protein accessionYP_529054 
Protein GI90023227 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000116276 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.305585 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAGAA CAACACAATG GGGGGTGAGC CTAATTGCCT GCCTGCTACT CACAGGCTGC 
AACGACTATT TGGCACCAAC AGATTCACTC GAGGTAGCGG TCAAAGGCAT ACATGCAGGC
GCCTTTAGCA GCCAAGCCGA CTACGCGATT GTAGGCTCTA TTCATCACGG CGGTAGCCTG
TGGCAAATAG ACAGCAAAGA GCGACTATAC AATTGGAATC ACAAAGCCGA TGAGGCCACA
ACTATCATCA GCGCTGACTT TTCCCCCGAC GGCAAACTCG CCATTACAGC GGACCCATAC
ACGCTGGTAA TGTGGGAAAC CACCAGCGGT GAAGCAACAC GTTACTGGAC TGCACCCGGT
GAAATTCTCG ACGCCAAACT CGGCCCTAAT GGCAGGCTTG CCCTACTCGG GCTAAGCGAC
CACAGCGCAG TGTTGTTTGA TATTCAAAAA GGCGGCATAA AGCGCACTCT GCCACACGGC
AATCGCGTGC GCAGCGTAGA CATAAGCGAG AATGGCCGCT GGGCATTAAC CGGGTCTGAA
GATTATCACG CGCGCTTTTG GGATTTATCC AATGGCAAGC AGCTGTTTGA TATTAAGCAC
GACGACGATG TACAGCTAGT AACGCTATCC AACGACGGCT CGCATGCACT ATCGGTAAGT
AAGTACGATG CCGCATTGGT ATGGGACACC AAAACAGGAG AAGTGCTTGG CAAAATCCCC
CTGCGCGCCG AAAGGTTAAA ACGCGGTTTG CAATTTACCT CCGCAGCCTT TAGCGCAAAT
AACGAATACT TGTTAACCGG TCGGCCAGAT CAAATAGTGC AATTGTGGCG CGTTGAAACC
TTAACACTTG TTAACCAGTG GGAGGTGCCT AAGCGCGATG CATGGAAACC AACAGGCGCT
TCGATTGTGG CTGTTGGCTT TGGCCAACAG GAGTGGACAT ACCTCGCCAT GAGCTCAAAC
GGCTTTATCC ACCACCTAAA ACTTTAG
 
Protein sequence
MLRTTQWGVS LIACLLLTGC NDYLAPTDSL EVAVKGIHAG AFSSQADYAI VGSIHHGGSL 
WQIDSKERLY NWNHKADEAT TIISADFSPD GKLAITADPY TLVMWETTSG EATRYWTAPG
EILDAKLGPN GRLALLGLSD HSAVLFDIQK GGIKRTLPHG NRVRSVDISE NGRWALTGSE
DYHARFWDLS NGKQLFDIKH DDDVQLVTLS NDGSHALSVS KYDAALVWDT KTGEVLGKIP
LRAERLKRGL QFTSAAFSAN NEYLLTGRPD QIVQLWRVET LTLVNQWEVP KRDAWKPTGA
SIVAVGFGQQ EWTYLAMSSN GFIHHLKL