Gene Sde_3322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3322 
Symbol 
ID3965871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4240777 
End bp4242039 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content50% 
IMG OID637922419 
ProductRNA polymerase ECF-subfamily sigma-70 factor 
Protein accessionYP_528789 
Protein GI90022962 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000453852 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATTCTC AAGAAACCTC ATCACCGTTT GAAGATGACC TTCCTGCTTT AATTGACGCC 
ATATATCGCG CGGAGTCTCG CCGTATATAC GCCACGCTTA TTCGGCTAAT AGGAGACATG
CAATTGGCAG AAGAGGCTAT GCACGACGCC TTTCATGTCG CCTTGAGTCA ATGGCAGGAT
AAAGGCATAC CGGATAATCC GCGTGCTTGG TTGGTATCTA CCGCCCGCTT TAAGGCTATC
GACCAGTTGC GGCGGCAAAC TCATTTAGCT GAGACTCTAG AGGCTATTAC GCCCCTTAGC
GACAGTGAAG CGCTGGATTG GGATGGCGAT ATTATTGAGG ATGATCAACT TCGCTTGATT
TTTACCTGCT GCCATCCCGC GCTAGACCCC AAGTTACAAA TCCCGCTCAC TTTAAGAGAG
GTGTGTGGCC TAACCACTGA AGAGATCGCT AGTGCGTATT TAGTAACCCC ATCGACAATG
GCACAGCGTA TTGTTCGAGG AAAGGCCAAG ATTCGTGATT CAAAACTTCC CTTCGAAATT
CCTGAACGCT CGCAGTTGGC GCAGCGCTTA GATGCAGTAC TAGCTGTTGT GTATCTTCTG
TTTAATGAAG GCTACTCGGC CACTAAAGGT GATACTTTGC TTAGAGTGGA GCTGTCATCA
GAAGCGATTC GATTATCGCG GCAGTTGCTG GAGCTAATGC AAGATAGCGA GATAGAAGGT
CTGCTTGCAC TTATGCTGCT GCATCAGTCA CGTAGTGCCA GTAGAACAAA TTCTGCTGGT
GATATTATTT TGCTAGAGGA TCAAGACCGA AGTCTATGGG CGAAAGACTT GATAGATGAG
GGACGATTTA GAGTCGGCCG CGCTTTCGTT CTCGGGTCGG TGGGGTTCTA TACTTTACAG
GCGGCTATTT CAGCTTGCCA TGCACAGGCG CCCACTTGGT TGGAAACTGA CTGGCAACAG
ATTGTTCAGC TGTATGAGGC TCTGTCGCAG GTCGACCCAT CTCCTATCGT GGAGCTCAAC
AAAGCAGTCG CAGTCTCAAT GCTTGAAGGG GCAGAAGCTG GGTTGAAGAT CATTACACAA
TTGATCCGCG GTCAAGAGTT GGAGCAGTAT CACTTGCTCC ACGCTGCTCA CGGCGAATTG
CTGAGCCGAA CTGGGGAACT AATGGGCGCT CGTTCGGCTT TTGAGCGAGC GTTGTCGCTA
ACGAATCAGG AGGCCGAGCG ACGGGTGTTA AAACTAAAGA TGAGTAGGCT CGACGCCATT
TAA
 
Protein sequence
MHSQETSSPF EDDLPALIDA IYRAESRRIY ATLIRLIGDM QLAEEAMHDA FHVALSQWQD 
KGIPDNPRAW LVSTARFKAI DQLRRQTHLA ETLEAITPLS DSEALDWDGD IIEDDQLRLI
FTCCHPALDP KLQIPLTLRE VCGLTTEEIA SAYLVTPSTM AQRIVRGKAK IRDSKLPFEI
PERSQLAQRL DAVLAVVYLL FNEGYSATKG DTLLRVELSS EAIRLSRQLL ELMQDSEIEG
LLALMLLHQS RSASRTNSAG DIILLEDQDR SLWAKDLIDE GRFRVGRAFV LGSVGFYTLQ
AAISACHAQA PTWLETDWQQ IVQLYEALSQ VDPSPIVELN KAVAVSMLEG AEAGLKIITQ
LIRGQELEQY HLLHAAHGEL LSRTGELMGA RSAFERALSL TNQEAERRVL KLKMSRLDAI