Gene Sde_4002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_4002 
Symbol 
ID3967421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp5038294 
End bp5039379 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content43% 
IMG OID637923099 
ProductAraC family transcriptional regulator 
Protein accessionYP_529469 
Protein GI90023642 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.24994 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAATG CAGCTAATAC GGTTAATAGC AAGATGGCTT ATACACAGGC GGCCTATTCG 
CAGGCGCTAA GCTCTAGCGC TCAAACGCAC TACCTGTTAC CCAGCGATAA AACCATCGTC
GCGCATTGGC AGCCAGCCAT ATTGTTAAAT TTAATGTGCA ATAGTTTGGC TAATGAAGAA
GCTATTGCGC TTTCTAATAA GCTTTTAAAG GGCACGCGCC TTTTTTATAG TGATTTTAGT
AAAACAAACC TGTTTATTAG CCCGGAGCAG TTTCAACGTT TTATTGTAAA TTGCAATGCT
ACCCCCAACC AATCCGATTT GGCATTTAGG TTTGGCCAAC GTTTATTGCC CGGTCATTAC
GGTGATTTTT CCCACGGCTT AAACCAAGTA AACAGCGTAT TCGCTGCCGC AGAATTAATT
CAAAAGTGCG CACATGTTTT CTCCCCATTG CTTACACCTA AGGTTAATGT GTATGCAACA
GAATTAACAA TTAGCTTTTA TTCAAGCTAT GGCTGCGGCA ATAGCCACAG GTTTGTGTGC
GAAGCATTTA TTTTTGCAAT TAAAAACTGG CTAGAGCAAC AACTGGGCAG GCACTTGCCT
TGGCAATTTG AGTTTAACTA CACCGCACCA GAAGCCATAG AAAATTACGA AGTGTACCTA
GGGGATAACC TTAAGTTTAA CCGACCCGTT ACCGCTATTC GCTTACCCAT TGAATATGCC
CATAGCAGCT GGCAAGTAAG CGAGAATTTT ACTTTACCCT GCGCAGCCAC CCCAGTTAGC
TTGCTAAATT TAGTGCGCCA ATTACTAAGA AACAACATAC AAGCCAACCC GAGTTTAGAG
TGGCTTGCAC AACAGTTAGA CATTAGCCCC GCCACATTAA AACGACGCCT TAAAGCCTGC
AATACGCAGT TTCGCGATTT ACTGAGCGAA ATTCGATTAG AAGTAGCGGT AGAACTTTAT
CAGCAACAAC ATTTTAGCAG CGATGCCATT TGCCAATACT TGGGCTTTTA CGACGAATCT
AATTTGCGCC GATTTTTTAA GCGCACCACC GGCCAAACAC ATACCCAGTA TTTGGCCTTA
ACCTAG
 
Protein sequence
MLNAANTVNS KMAYTQAAYS QALSSSAQTH YLLPSDKTIV AHWQPAILLN LMCNSLANEE 
AIALSNKLLK GTRLFYSDFS KTNLFISPEQ FQRFIVNCNA TPNQSDLAFR FGQRLLPGHY
GDFSHGLNQV NSVFAAAELI QKCAHVFSPL LTPKVNVYAT ELTISFYSSY GCGNSHRFVC
EAFIFAIKNW LEQQLGRHLP WQFEFNYTAP EAIENYEVYL GDNLKFNRPV TAIRLPIEYA
HSSWQVSENF TLPCAATPVS LLNLVRQLLR NNIQANPSLE WLAQQLDISP ATLKRRLKAC
NTQFRDLLSE IRLEVAVELY QQQHFSSDAI CQYLGFYDES NLRRFFKRTT GQTHTQYLAL
T