Gene Sde_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3042 
Symbol 
ID3967706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3889329 
End bp3890357 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content49% 
IMG OID637922139 
ProductLacI family transcription regulator 
Protein accessionYP_528511 
Protein GI90022684 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000688276 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACCAA CAATTAAAGA TGTAGCTAAG CTCGCAGGGG TATCGTTTAA AACTGTTTCG 
CGCGTAGTAA ACAAAGAGAG TACCGTTGGT GAAGCCCTGC AAGAAAAAGT ATGGAAAGCG
ATTAACGAGC TTGGTTATAA GCCTAATTTG TCTGCCCGTG GTTTGCGTGG CGCTGCATCG
TCCATAGGTT TTATTTACGA TAACCCCAAC AGCAACTACG TAATCGATAT GCAGCGCGGT
ATTCTTAACG AATGCCATAA GCGCGGCTAT GAGCTAGTTA TTCACCCGTG TAATGCATCT
GGCGAGCACA TTATTGATGA AGTGATCGAA ATGATCGATC GCAGCCGGGT AGGGGGCCTA
GTGCTCACAC CGCCTATTTC CGAAAACCCC GAAATACTCG CAGCTATTGC TAATAAAAAA
GTCGAATTCG TACGTATTTT ATCTGGCAGC GCCGCACCAG ATACATTGTC GCCTTGTGTT
TACATCGATG ACCGCACAGC GGCTTACACA ATTACGCAGC ACTTAATCGA TTTAAACCAC
AAAGATATCG CCTTTTTGGG CGGTGATGAA GAGCATAAAT CCAGTGGCGA ACGTTTGGAA
GGCTACCGCT CTGCCTTAGC AGATAACGGC ATCACCCCCC ACGAAAACCA TATATTACCC
GGTAAATACT CGTTTGAATC TGGAGTGGAG CGCACCCGTG CGTTACTCGA GCTAGATGGC
CCACGCCCAA CCGCGGTGTT TGCCTGTAAC GATGAAATTG CAGCGGGTAC CTTGTTTGCT
GCCCGTATTG CGGGTGTAGA TGTACCAAAT CAGCTCTCCA TAGTGGGGTT CGAAGATAGC
CCCTTTTCGC GCCAAGCCTG GCCAAACCTT ACTACGGCCC AGCAACCCAC TAGCACCATT
GCGCAGCGTG CCACTGCACT ACTAATTGAC ACCTTAAAGA GCCGCGCTGA AGGCTCGCAA
GTTGTTGAAA GTGAAGGGTT TTTACCTAAA CTTATTGTGC GCGACTCCTC CCAAACTGCC
CCAGTATAA
 
Protein sequence
MKPTIKDVAK LAGVSFKTVS RVVNKESTVG EALQEKVWKA INELGYKPNL SARGLRGAAS 
SIGFIYDNPN SNYVIDMQRG ILNECHKRGY ELVIHPCNAS GEHIIDEVIE MIDRSRVGGL
VLTPPISENP EILAAIANKK VEFVRILSGS AAPDTLSPCV YIDDRTAAYT ITQHLIDLNH
KDIAFLGGDE EHKSSGERLE GYRSALADNG ITPHENHILP GKYSFESGVE RTRALLELDG
PRPTAVFACN DEIAAGTLFA ARIAGVDVPN QLSIVGFEDS PFSRQAWPNL TTAQQPTSTI
AQRATALLID TLKSRAEGSQ VVESEGFLPK LIVRDSSQTA PV