Gene Sde_3947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3947 
Symbol 
ID3967212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4974930 
End bp4976186 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content48% 
IMG OID637923044 
Producthypothetical protein 
Protein accessionYP_529414 
Protein GI90023587 
COG category[S] Function unknown 
COG ID[COG4289] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.174675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAC GCGATTTTTT AAACGCCGCT GCATTGGGCG GTGCCGCATA TATGACGGGT 
TTACCGCAAT TTGCCAATGC TAAAACGCTT ACTACCGGTC AACAGGATCA CACCTACTTT
GCAAATTTAT TAGAGAAAAT TTGTTCGCCT ATTTTGCATT TAATGGCTAA CGAACAATTC
CATGCAAAGT TCCCATTAGA GGTAGGAGCC AACAGTGATG GACGCGACCA TCGCGTTGCC
TACCTTGAAT GTTTTGGCCG CACCATTGCT GGCGCTGCAC CTTGGTTAGC ACTTGATACA
CCCGGCCCAG AAAAAGCCAC ACGCAGCAAG CTAAGAGACC AAGCCATTGC GGCCTATGAA
AATTCCGTGA ACCCCAAAAG CCCCGATTAC CTCGACTGGC AAGTAGGCCA CGGCCAAATG
TTGGTGGACT CTGCCTACTA CACCCAAGCA CTTATACGTG CGCCTATTTT GTGGCAAAAG
CTTACCCGTA AAACTCAGCA GCGTATTGTA AAAGAGATAA AGGCGTTGCG AAAAATTCCA
CCGCCCTACA CCAATTGGCT GCTATTCGCC GCCATGAACG AAGCATTTTT AATGCAAGTA
GGGGAAGAGT ACGACCCCAT TCGACTCGAT CTAGCACTGC GAAAATTTTT AGAGTGGTAC
GTAGGCGACG GATGGTTTGC AGATGGCGAG CACTTCGCGT TTGACTACTA CGGCTCTTAC
GTTATTCACC CCATGCTGTT AGATATATTA GAAGTGATGG CTGCCCACAA CACCTACTTT
TGGCACGGGG ACATCAAAGA CGTACTGGCA ACTCATTTAA AACGCAATCA ACGCTTCGCC
GAACATTTAG AGCGCTTGAT TTCACCTACG GGTACCTACC CACCTATAGG GCGCTCATTT
ACCTATCGCA CCGCGGCTTT TCAACCACTC GCGCAACTAG CACTAAAACA CAAGCTGCCC
GATAGCTTAC CGCAAGGCAG AGTGCGCGCA GCAATGCGCG CCGTTCACGA AGCCATTTTC
AGCAACCCTT CAAACTTTAG CAAAGAGGGG TTTTTAAAAA TTGGTTTTGC AGGCGCCGAC
CTTTCGCTTG CCGATTGGTA TTCCAACAAT GGCAGCATGT ACATAACAAC CGCAAGCTTT
TTACCCCTTG GGCTACCACT TAGCGACCCC TACTGGCAGG TACCAGGCGA AGATTGGACA
CAAAAACTGG CGTTTAGCGG GCAGAAATTT AAGAAGGATT ATTCAGTTTC TTATTAA
 
Protein sequence
MKRRDFLNAA ALGGAAYMTG LPQFANAKTL TTGQQDHTYF ANLLEKICSP ILHLMANEQF 
HAKFPLEVGA NSDGRDHRVA YLECFGRTIA GAAPWLALDT PGPEKATRSK LRDQAIAAYE
NSVNPKSPDY LDWQVGHGQM LVDSAYYTQA LIRAPILWQK LTRKTQQRIV KEIKALRKIP
PPYTNWLLFA AMNEAFLMQV GEEYDPIRLD LALRKFLEWY VGDGWFADGE HFAFDYYGSY
VIHPMLLDIL EVMAAHNTYF WHGDIKDVLA THLKRNQRFA EHLERLISPT GTYPPIGRSF
TYRTAAFQPL AQLALKHKLP DSLPQGRVRA AMRAVHEAIF SNPSNFSKEG FLKIGFAGAD
LSLADWYSNN GSMYITTASF LPLGLPLSDP YWQVPGEDWT QKLAFSGQKF KKDYSVSY