Gene Sde_3841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3841 
Symbol 
ID3966990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4840682 
End bp4841932 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content47% 
IMG OID637922938 
Producthypothetical protein 
Protein accessionYP_529308 
Protein GI90023481 
COG category[S] Function unknown 
COG ID[COG3146] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.614644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATATA AATTTATACC CAGTATTCAC CACATAGATG CGCAAAGCTG GGACAGCTTA 
TTTGCACAGA GCAACGGCTA CCCGTTTGTA CAGCATGCGT TTTTAGCCAC ACTAGAAAAT
ACCGACTGCA CCAATAAAGC CAGCGGCTGG CAGCCGCACC ACCTAACGGT TACCAATAGC
AGCGGCCAAT TAGTAGCAGC ACTGGTATTA TTTTTAAAAA ATCACTCCTA CGGCGAATAT
GTATTCGACT GGAGCTGGGC CGACGCCTAC GCACAAAACG GCTTAGATTA TTACCCCAAA
TTCGTTAGCG CCATTCCCTT CACCCCCGCC ACTGGGCCAA GGTTTGGTTT TGCCGAAACC
CTAAGCGCAG AAGAAAAAAA CAATTTAGCC AAAGCGATGG CCGACCACCT ACAAGCGTTT
AGCCAAGAGA AAAACATAAG TGGCGTGCAT ATTTTGTTTC CATCAATGGA ACAATTTAAT
TTACCGCTCA ACACAACCAA CAACGTTGAA GAACCATTAG AAAGCCAATG GCAGCAACGC
ATAGGCTGCC AATACCAATG GTTTAACCAA GGTTACAGTA GCTTCGAACA ATTTTTAGAA
TCTTTTAGTT CACGCAAACG CAAAAATGTA AAAAAAGAAC GCGCAAAAGT AAGCCAGCAG
AATATTACGC TAACCATGCG CAGTGCCGAT GCTGTGCCAT TAGAAGAGTG GGAAACATTC
TACGCCCTTT ACCATCACAC CTACTTAAAA CGCAGTGGCC GCTACGGCTA CCTCACTAAA
GCATTTTTTT TATCGCTGGC AAAAATATTT CCAACACAAG TACTACTTTG CCAAGCGTGG
CGCGACCATG AAATGATTGC CGCTGCACTT TATTTTCGCG ACAACACAAC ACTGTATGGC
CGCTACTGGG GCGCAATGGA AGAACTAGAC GGCCTGCACT TTGAAGCCTG CTACTACCAA
GGCATAGAGT ACGCCATTGC CAACGGCCTA CAACGCTTCG ACCCTGGCGC ACAAGGCGAA
CACAAAATTC AACGCGGTTT TGTACCCGTA AAAACCAGCT CGCTACACTG GCTAGCCCAC
CCGGGCTTTC AACACGGCGT GGCTCAGTTT TTACAAAGCG AGACGCAACA CATGCAGGGT
TTTATGAAAG AGGCGAGAGA GTTGCTGCCG TTTAAGGAAG GGATTGAGCT GCCTGCGGAG
GGGTGTTTGT TGGGAATAAG AGAGGAAGTT AAACAGAAAG CCTTCGAATA G
 
Protein sequence
MQYKFIPSIH HIDAQSWDSL FAQSNGYPFV QHAFLATLEN TDCTNKASGW QPHHLTVTNS 
SGQLVAALVL FLKNHSYGEY VFDWSWADAY AQNGLDYYPK FVSAIPFTPA TGPRFGFAET
LSAEEKNNLA KAMADHLQAF SQEKNISGVH ILFPSMEQFN LPLNTTNNVE EPLESQWQQR
IGCQYQWFNQ GYSSFEQFLE SFSSRKRKNV KKERAKVSQQ NITLTMRSAD AVPLEEWETF
YALYHHTYLK RSGRYGYLTK AFFLSLAKIF PTQVLLCQAW RDHEMIAAAL YFRDNTTLYG
RYWGAMEELD GLHFEACYYQ GIEYAIANGL QRFDPGAQGE HKIQRGFVPV KTSSLHWLAH
PGFQHGVAQF LQSETQHMQG FMKEARELLP FKEGIELPAE GCLLGIREEV KQKAFE