Gene Plim_3906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3906 
Symbol 
ID9140624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5020019 
End bp5021248 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content57% 
IMG OID 
Producthelix-turn-helix- domain containing protein AraC type 
Protein accessionYP_003631916 
Protein GI296124138 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCAAG AAGCCCCCGC TTCCCAGGAT TCGTCCGAAA TTTCTCCGCG TCCAGCGACA 
TTGCCGGTAT CGGCCATGCC ACCCACCATG CATCGCAGTG TGGCACTCAT CGTCCATACC
TCATCTGACT GGACGAGGCA GGTCTTGCGC GGGATTGCCC AGTATGCCAG CGAGCACGGC
TTCTGGGATT TCTTCATCGA GCCACGCGGT CACGACGAAA AACTGCTGAT TCCCAAGGGC
TGGAGTGGTG ATGGCATTAT TGCCCGCCTC ACTCACCCCG CACTCGAAAA GCAGATCCTC
AAAAGCAACC TGCCCTGCGT CAACGTCTCC TGGATGGGAC AGCACTCCCT CCGGATCCCC
AAAGTCGTCT CCGATGAAGC GGCTTGCGGA AGGCTCGCTG CCGAACATTT CCTCGAACAA
TCCTTCCGCA GCTTCGCCTA CATCGGGCCG ATTCATCGTG CGGGATATGA CGATCTGCTG
GGGAAAAACT ACATCAATAC GCTATTGCAA GCCGGTCATC AGACCAGCAT TTATCAGCCC
ACAGTTCCCA TTCCTGTCCC TGACCTTTTA ACTCAACGAA AAGGTTTTCT GCAGTGGCTG
AAAATGCTCC CCAAGCCAAC GGCAATCTTC ACCTGGAGTG GTGAAGTGGG CCGCGAGTTA
ATGACCTGTG CCCGCCTCAG CCGCTTTCGG ATTCCCGACG ATATCGCCCT GCTCGTTGGT
GAAAACGATC CACTGTTGTC AGCTCTGGCA CCAGTCCCGC TTTCCAATAT CGACCAGGCT
CCTGTCCGTG TCGGTTACGA AGCAGCGACT TTGCTCGACA AACTCATGAA TGGCGAACCA
GCACCGGAAG AGCCGATTCT TGTCCCCCCT GTCGGTGTGG TGCAGCGCCG ATCCACGGAA
ACATCGGCTG TCGATGATCC TCTGGTCGAT ATGGCCGTTC GTTTTATGCG CGACCATTTG
AGCGAACCGA TTCAGATTGC GGATGTGGAG AAAGCTCTCA ATGTCTCCCG CCGGGTGCTG
GAACACCGCT TCCATAAAGT GCTCGACGAT ACCCCAGCCA ATGTCCTTCG CAGGATGAGA
TTGCAGAACG TCAAACGCCT CCTGGGAGAA ACCACTCTTC CACTGGCCCG CATTGCCCAG
CTCACTGGCT TTAATCACGT GGAAGTTCTG GTGCGTACCT TCCGCCGTGA GCTGGGGGTC
ACTCCCGGCG AATACCGGCG TCGCCACTAA
 
Protein sequence
MSQEAPASQD SSEISPRPAT LPVSAMPPTM HRSVALIVHT SSDWTRQVLR GIAQYASEHG 
FWDFFIEPRG HDEKLLIPKG WSGDGIIARL THPALEKQIL KSNLPCVNVS WMGQHSLRIP
KVVSDEAACG RLAAEHFLEQ SFRSFAYIGP IHRAGYDDLL GKNYINTLLQ AGHQTSIYQP
TVPIPVPDLL TQRKGFLQWL KMLPKPTAIF TWSGEVGREL MTCARLSRFR IPDDIALLVG
ENDPLLSALA PVPLSNIDQA PVRVGYEAAT LLDKLMNGEP APEEPILVPP VGVVQRRSTE
TSAVDDPLVD MAVRFMRDHL SEPIQIADVE KALNVSRRVL EHRFHKVLDD TPANVLRRMR
LQNVKRLLGE TTLPLARIAQ LTGFNHVEVL VRTFRRELGV TPGEYRRRH