Gene Sde_2374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2374 
Symbol 
ID3967368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3005122 
End bp3006501 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content50% 
IMG OID637921465 
Producthypothetical protein 
Protein accessionYP_527846 
Protein GI90022019 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.060586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAT TATCACAACC ATCTCGACGT AAGTTCCTCA TGGGGTTAGC TAAAACTGGC 
ATGCTGCTAC CGTTTGCAGG GCAAATGTTA GGGCAAAAAG CATTGGCTGC AGGTACTGGT
GCGCAGCGGG TTTTATTTAT GTATTACCCT AATGGCGTTG TGCCCTATAA CTGGCGGCCG
CAGCAAGATA GTGGCGCTAT TACCACTACC AACGAGCTAA GCTTTGGTTT GGGGCCGTTA
AAAGATTGGC ACAATAAAAT GATTGTGTTT AAAAATTTAA ACCTAGACGT AGGCCAAGGT
GCTGGGGCGC ACTACAACGA TATGCGCGGT ATTTTAACCG GCGATAACCA AATAGGCGTC
GACGGCGCGA GTATCGATCA CTTAATTGCA GAGCGCTTGG GCGACGAGGG CGTATTAAGT
TTGGGTGTGC GTACTGGGCC TAAAAAAGAC ATTATGATCT CTAAACCGCG CGGTTATAAA
ACCGATAGCC GCCCAATACC GAATAACGAC CCGCGCGATG TCGCAAGTAA ATTGGCGCTG
CGTATTGGTC CATCAGACGG TTTATCCGAT AACAAAAAGG CTATGTACGA AGCTATATTG
TCAGACTTTG ATGACCTAGC CGATGCAACC CTAACTAATA CGCGGCAAAC CAAATTAGAT
TTTCATAGCA ATGCGTTAAT TCGCCTGCGC GATCAAGCCG GTACTAAAGT GGGGGAGTGT
GGTTTTAATA GTAACGTGAT GACCGACCCC TATTATGAAA CAACCCAAAG CTTAACCTCT
ACCGAGTGGC AAATGTTCCC GCACTTGGCT AGGGCGCAGG TAGATAATAT TGTTGGCGCC
TTCGCTTGCG GTTTGCATAA GGTAGCCACC TTGCAGCTTT CCAAAGGCGA TGAGAACGGC
AACTTAGTGA ATTACTCTTT CGATGAATGC TGGCAAATGG CCCAGGATGC AGTAGCACAA
GGTATTAAAC CCGATGCAAA CTCTGGCCCT GTAAGCGAAA TGACCCGCTG GTACAACGAG
CACGCCAGCC ACAGCGCATC CCATAGGCCG GGGGCTGTGC CACACACGGC GCAAGTGCGT
TGGTATCACT CATTGTTGGC CTATACCCTG CAGCAGCTGC AGGCTCGCGG TTTATTAGAC
GATACCCTTG TAGTGTTGTT TTCGGAAGTT GGCGAAGGTG CTAAGCACGG TGGGGCTGCA
GGTTCGGTAA CCCTTGCTGG CGGTGCTGCC GGCGATTTAG AAATGGGGCG CGTTATTTAC
TGTGGTGATG GCAATAACGA TGGTCACAGG GTGTGGGGCA GTCACCAATT ATTTGGTGAT
ATTGCACGCT TAGTGGGTGC TCCAGATATT GGCGGGCCTT GGTCTGGCGG CGTTATTTAA
 
Protein sequence
MSKLSQPSRR KFLMGLAKTG MLLPFAGQML GQKALAAGTG AQRVLFMYYP NGVVPYNWRP 
QQDSGAITTT NELSFGLGPL KDWHNKMIVF KNLNLDVGQG AGAHYNDMRG ILTGDNQIGV
DGASIDHLIA ERLGDEGVLS LGVRTGPKKD IMISKPRGYK TDSRPIPNND PRDVASKLAL
RIGPSDGLSD NKKAMYEAIL SDFDDLADAT LTNTRQTKLD FHSNALIRLR DQAGTKVGEC
GFNSNVMTDP YYETTQSLTS TEWQMFPHLA RAQVDNIVGA FACGLHKVAT LQLSKGDENG
NLVNYSFDEC WQMAQDAVAQ GIKPDANSGP VSEMTRWYNE HASHSASHRP GAVPHTAQVR
WYHSLLAYTL QQLQARGLLD DTLVVLFSEV GEGAKHGGAA GSVTLAGGAA GDLEMGRVIY
CGDGNNDGHR VWGSHQLFGD IARLVGAPDI GGPWSGGVI