Gene Sde_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3031 
Symbol 
ID3967695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3871493 
End bp3872512 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content50% 
IMG OID637922128 
ProductAraC family transcriptional regulator 
Protein accessionYP_528500 
Protein GI90022673 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000013989 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000160975 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCCG CACCTATCAA TACTAATCCG AGCAGCCGCG CCCTTACGGC CCCAGCACTC 
GAGGCGCTAG CGAACTACGC AGGGTGCAGA AACGCCACAA TCCACAGCAA AGGCGCAAAA
ACAGACACGT TAATTAACGG CCATTTCACC TTAAAGCGGG AGTTCGATGG CATATTGGTG
CATGCCTGCA ATATGGAAGA ACAAAAAGAT GCCCTAGTAG CCAGCCAGCA ACACGCTGGG
CTTACCTTTG GTGTATTGAT AGAAGGGAAA ATAACGTTTG GCTTTAATGG AGAGTTCGGC
ACTATAGAAG CTGCAAACGG CGCCCAGGGT TGGGCCACTA ACCTCACCCA AAACGCCGCA
TGGCAGCGCA AACTGACCAA CAAGCAGCAA GTGATTAAAC TTGTGGTTTC TGTACCGCCG
CAATGGATAA AACAGCACCT GTGGCAAAAC CCCGCCCCAG CCTTTTTAAA TAGGTTTATA
AGCACCCATT TAGCGCGAAC ACATTGGCTG GCATCCGGCA GTTTAGTGCG CCACGCAAAA
GCGGTAATGA GCAGCCACAG TAATAGCCCC AGTCAAGCCC TGCACTTCCA CGCCAATGTA
CTCGCATTTA TTGCGCAAGC ACTGGACGAC ATTGAAGCCA GCGGCGAGCG CATTTTTAAC
TTGCACAACC CTAACCGAAC CAGCCGGAGC CTTAGCAGCC AAGCTATAAA AGTGCAGCAG
CATTTAGAGC ACTGCATTAA TGAGCTGCAG CCGGGTGCTC ACATTCAATT GGAAGATATA
TCACATGCGC TGGGCATGAG TGTAAGCAAA TTGCAGCGTT TATCGAAAGC ACACTTTGGC
TGCACTATTG CCGAGTACAT TCGTATTCGT CGCCTAGAAA AAGCGCGCCA CGAAATTCAG
CACAACAATT TAAGTATTGG CGAAGCTGCT TTTTTAGCCG GCTATAATCA CAGGTCCAAC
TTCTCTAAGG CTTTCAAACA ATATTTCAAT TTATGCCCCG GCGACATAGC GCCGCAGTAA
 
Protein sequence
MTAAPINTNP SSRALTAPAL EALANYAGCR NATIHSKGAK TDTLINGHFT LKREFDGILV 
HACNMEEQKD ALVASQQHAG LTFGVLIEGK ITFGFNGEFG TIEAANGAQG WATNLTQNAA
WQRKLTNKQQ VIKLVVSVPP QWIKQHLWQN PAPAFLNRFI STHLARTHWL ASGSLVRHAK
AVMSSHSNSP SQALHFHANV LAFIAQALDD IEASGERIFN LHNPNRTSRS LSSQAIKVQQ
HLEHCINELQ PGAHIQLEDI SHALGMSVSK LQRLSKAHFG CTIAEYIRIR RLEKARHEIQ
HNNLSIGEAA FLAGYNHRSN FSKAFKQYFN LCPGDIAPQ