Gene Plim_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3801 
Symbol 
ID9140519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4883841 
End bp4886147 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content56% 
IMG OID 
ProductThioredoxin domain protein 
Protein accessionYP_003631812 
Protein GI296124034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGAG ATTCGACCAT TGCACGCCAT GTTTCGCCCA CGACCGAGTC GTTTGCAGGA 
CAGGTTCCCG GCTGGTACGG AGAAATCCCA TCGGCTGATC AACCGGTGCC TGTTGCCGAT
AGCTCCCGTC GGCCTGCGGA GGTGAACTCC CTCCCTGAGA CAAAGCTGAC TCAGGAAGCA
TCGAGAGTTT CCCGCTCAGC CGTAGTCTCT GAATCGGGCG TTGAATCTTC CAATAAGTCA
TCTCTGCATT CTCGATCTCA CGATCAAGTT TCTAAGCCTG CGAAGACACA GGCCCATGCC
TTGCAGATTG CCCTGTCCCA TCGCTCGATC ATTGCTGAAG CGGATGCAAC AGCCGAGGCG
ACAACTCCGG AAGTACCGGT TGCTTCTGCT TTTGAGCCTG AGCTTCCAGC GACAATCCGC
AGTTCTCGAT TTTTCGTTCA CGGCAAACTT TCCCGCACAG CAGGTCTCGC GAGTGCCATG
GCTCTCACGA TGACCTTGCT AGTGGCCGCC GTGGTTGCCA TTGTCTGTCC CACGACATGG
CCGGCTGGTC TGGGGGGTTC GAATCAGACG CTGAAAGCTG CAGGGACACC GGCCAACTCT
GTCGTACTCG ACTTTACCGC TTCCTGGTGC GGGCCCTGTC AGCAGATGAG CCCGATTGTC
TCGAAACTGC AAAGACAAGG TTATGCGATT CGCAAAGTGG ATGTGGATCA GGAACCCAAT
CTCGCCCGTC AGTTTCGCGT TGATTCCATG CCCACATTTA TCCTCGTGAT CGACGGCAAA
GAAGTGAATC GCATTGTCGG CATGACCAAC GAAGCTCAAT TGCGACGGAT GGCTGAGCAG
GCCGTGCAGG GTTCACAACT CGCACGTGGT CAGCAGGAGC GACTGGCAGT TCCAGGTTCA
AAACCACTGC CCAAGACCAA TAGTTCACCA TTCGATGCCG ACGGCGTACC CACCACGGTG
TTGGGTGAAT CATCGCGTCT GGAAGCAGAA ATTGTCTCCA ACCCTCGCAA TGCCCCCGCC
GCTCCCCGCA CTGGAATTCT CAATTCGATT AGCGAAGGCA TGGGCCTGAC CCGATCGGCC
AATCCTTCTA TTCCTGCAGA ACCTAAGCCA CCGGCAGCTC CCGCCAACGA ACTCTTTCGC
GGGAACAATG CAGAAGACGA TGCCGAAATG ACAGTGGCTC ATGTTGATCC CACTTTGGCC
AGTGTGCGTA TTCGAGTTCG CGACAGCCAG GGTGGAATGT CGAACTATGG ATCGGGAACC
ATTGTCGATA GCCAGGCTGG CCGCACGACG ATCATCACCT GCGGGCATAT TTTCCGTGAT
CTGAAATCCA AGCCTGTTGT CGAAGTGGAT GTCTTCCAGG CTCAAGGGAG TCCTCGGACC
TACCTTGGCG AAGTGATGGA TTTCAATCTC GATGCGGATG TGGGTCTGAT CATGATTCCC
ACCGAAAAGG CCGTTTCTTT GGCCCGGCTA TCCCCTGTGG ATGTGAGACT CGGGCCGGGC
GAAGAGATGT TCAGTATTGG CTGCGGTGGA GGCGCCAATC CTTCACGCGA AAGTCACAAG
GTGACAGCCA TCAACCGTTA CAACGGGCCG GAAAACATTG AATGCACAGG AGTGCCGATT
CAGGGTCGCT CCGGTGGCGG GCTGTTCCGA GCCGATGGTC AATTGGCTGG TGTGTGCATT
GCTGCCGATA CGAAAGAACG CCGTGGTCTT TATGCCGGTT TGGAACCCAT CTGCACCATG
CTCGAAAAGC ATCGGCTGGG TGCTCTCGTT CGACGAGAAG GTCGAGGCCG GGCGAATACG
GCTGTCGCCA GTGCCGCCCG CACGGCTGAT GCAGGAAGTT CGTCTGAAGC CATGGGTGAC
ATCCTGATGG CTGGGGGTGG TAACAATCGC GCAGTTTCAG GTGCTGCGGG TAGTAATAAC
AGTGGCACGA ATGCCGCTGC TGAGGCCGCA TTGATGAATC AATCGCCCGA TGCGGAAATC
ATCTGCATTG TGCGTCCACG CGGTGGTGCG GGAGATAACA GTCGGATTGT GGTTCTGAAT
CGTGCCAGCC AGACGTTCAT GACCTATCTG ATGGGAGAGG TTGACTCTCA GTCGCAGCGT
CTGCCGGTTT CACTCCGAGT GGACGACAAC CAGCGGCAGA CAGCTTCGTC GGTGTCTGAA
AAATCAGTGG CCGCCGTTGA ACCTGCCAGA GAGCCTGCTC GTGAAGCTCT CGGGCCGAGA
CCGTGGGTTT CGAGCAATTC GACCGCCTTC AAGCGACAGG TTTTGCCGCG AAGGGAAGAA
CTTGTGCCTC TGACAGAACG GAAATAG
 
Protein sequence
MDRDSTIARH VSPTTESFAG QVPGWYGEIP SADQPVPVAD SSRRPAEVNS LPETKLTQEA 
SRVSRSAVVS ESGVESSNKS SLHSRSHDQV SKPAKTQAHA LQIALSHRSI IAEADATAEA
TTPEVPVASA FEPELPATIR SSRFFVHGKL SRTAGLASAM ALTMTLLVAA VVAIVCPTTW
PAGLGGSNQT LKAAGTPANS VVLDFTASWC GPCQQMSPIV SKLQRQGYAI RKVDVDQEPN
LARQFRVDSM PTFILVIDGK EVNRIVGMTN EAQLRRMAEQ AVQGSQLARG QQERLAVPGS
KPLPKTNSSP FDADGVPTTV LGESSRLEAE IVSNPRNAPA APRTGILNSI SEGMGLTRSA
NPSIPAEPKP PAAPANELFR GNNAEDDAEM TVAHVDPTLA SVRIRVRDSQ GGMSNYGSGT
IVDSQAGRTT IITCGHIFRD LKSKPVVEVD VFQAQGSPRT YLGEVMDFNL DADVGLIMIP
TEKAVSLARL SPVDVRLGPG EEMFSIGCGG GANPSRESHK VTAINRYNGP ENIECTGVPI
QGRSGGGLFR ADGQLAGVCI AADTKERRGL YAGLEPICTM LEKHRLGALV RREGRGRANT
AVASAARTAD AGSSSEAMGD ILMAGGGNNR AVSGAAGSNN SGTNAAAEAA LMNQSPDAEI
ICIVRPRGGA GDNSRIVVLN RASQTFMTYL MGEVDSQSQR LPVSLRVDDN QRQTASSVSE
KSVAAVEPAR EPAREALGPR PWVSSNSTAF KRQVLPRREE LVPLTERK