Gene Plim_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2074 
Symbol 
ID9138777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2689046 
End bp2691166 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content53% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003630100 
Protein GI296122322 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.838738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTGC CGCACCGTTT TCGCTTTCCA GTCATTTGGG CATTTACGCT CAGTGTGATC 
ACTCTTCCGT TGGCCAATTC AGTGAGAGGT CAGGTCTCGC CAGGTATAGC CAAAACGTAT
GACATCGTGG TTTATGGTGG GACTTCGGGC GGGATTGCCG CTGCGATTCA GTCGGCCCGA
TTGGGAAAAT CGGTTGTCGT CATTGAGCCA TCGAATCATC TGGGAGGATT AACGACGGGT
GGACTCGGTG CTACTGATAT TGGAAATAAA GCGGCCATTG GTGGACTCTC GCGTGAGTTC
TATCAGCGAG TCGGAAAGTG GTATGCCGAC CCCAAAAACT GGGTGCACGA AAGCGCGTCT
AAATACAATG AGCGGCGTAA ATCGAGTGGT GAGAATGAAA TGTGGACCTT CGAACCACAT
GTCGCGACGA AGGTTTATAA AGACTGGCTG GCTGAATACC CCACGATTCA AGTCGTGATG
AATGAGCGGC TCGATCTCAA GAACGGTGTG AAGAAAAATC CCGCAACCAA AGCCATTGAA
TCGATCAGCA TGGAATCGGG GAAGGTATAC GCGGGTAAGG TGTTCATCGA CGCCACCTAT
GAAGGCGACC TGATGGCCAA AGCTGGCGTG CCTTATCATG TGGGGCGTGA GGGCGAAAAA
GTTTACGGCG AGACACTGAA CGGGATTCGC GTGGAAAAAT CGACTCACCA TCAGTTCACA
CACAAGGTTG ATCCTTATGT GATTCCGGGC AAGCCAGAGA GTGGGCTGAT TCCTCTGATT
CAGGCCGGCG GCCCTGGAGA AGAAGGTGCT GGTGATCATC GCGTGCAGGC TTACAACTAT
CGCATGTGTA CGACCGATGT GGCGGAGAAT CGCCGTGCCT GGCCAAAGCC CGAAGGTTAT
GACGAAAAAA CGTATGAGCT GGTGCTCAGA AACTGTGAGG CGGGTGATCA TCGCAAGTCG
TGGAATCCTG TCTGGATGCC CAATCGTAAG ACCGACACGA ATAACAATTT CGCAGTTTCG
ACAGATTACA TTGGGGCAAA CTACGAGTAT CCCGATGCCG ACTATGCCAA ACGGCAGGCC
ATCATCGATG ATCACAAGCG ATACCAGCAG GGTCTGATGT GGACGCTGGC GAATCATCCC
CGCGTACCGC AGGAGATTCG CGATCATTTC CAGAAGTTAG GGCTGGCCAA AGACGAGTTT
GTGGAGACCG ATAACTGGCC GCCGCAGCTG TATGTGCGTG AGGCTCGTCG CATGATCTCG
GACTATGTGA TGACTCAGCA CAATTGCCAG CGTAAAGTCA CAGCCGAAGA TTCCGTCGGA
ATGGGTGCCT ATAATATGGA TTCGCACAAC TGCCAGCGGT ATGTGACCAA AGAAGGCTAC
GTGCGGAACG AAGGCGATAT TCAGGTGGGT GTCCCCCCCT ACCCCATTTC CTACAAGAGC
ATTCGCCCGG CGAAAGAGCA CGTCACCAAT CTGCTGGTGC CGGTCTGCTT GTCGGCCTCG
CACATCTCTT ATGGTTCGAT TCGCATGGAG CCTGTCTTTA TGGTGCTCGG GCAGTCGGCA
GCGACAGCAG CCAGCTTTGC CATCGATGGC AAAACCACCG TTCAAGATGT GGATTATGCG
AAATTGCGGG AGAAGCTGCT GGAAGACAAG CAGGTGCTGG AATGGCAAGG TTCTCGCGGT
GGAGCAGCGG GGATTGTGCC AAGTTCATTG CCTGGGGTTG TCGTCGATGA CGCGTTGGCA
AGAACCAAGG GCGAATGGAC AGCCAGTGGT TCGATCAGCG GATTTGTCGG CAGTGGATAT
CAGACCGATG GAAACGAGCA GAAGGGGGAG AAATCGGCTC TTTTCGAACT GAAGATTCCC
AAGACAGGGA GTTATGACAT TCGTATGAGT TGGACGCCGA ATGCCAACCG CGCGAGTAAT
GTCCCAGTGG TGGTTGAAGC AGGGACGTTA CGAACCGAGA CCAGAGTCAA TCAGCGAAAT
GCTGCCGGGA AAGATGGCTT TCATACTCTG GGCCGCATGA CGTTAAATGC GGGGCAGACG
GTGAATGTCG TCATCTCGAA TGACAAGACC GATGGCCACG TCATTATCGA CGCAGTGCAG
GCACTTCTTG CTCAAGACTA A
 
Protein sequence
MILPHRFRFP VIWAFTLSVI TLPLANSVRG QVSPGIAKTY DIVVYGGTSG GIAAAIQSAR 
LGKSVVVIEP SNHLGGLTTG GLGATDIGNK AAIGGLSREF YQRVGKWYAD PKNWVHESAS
KYNERRKSSG ENEMWTFEPH VATKVYKDWL AEYPTIQVVM NERLDLKNGV KKNPATKAIE
SISMESGKVY AGKVFIDATY EGDLMAKAGV PYHVGREGEK VYGETLNGIR VEKSTHHQFT
HKVDPYVIPG KPESGLIPLI QAGGPGEEGA GDHRVQAYNY RMCTTDVAEN RRAWPKPEGY
DEKTYELVLR NCEAGDHRKS WNPVWMPNRK TDTNNNFAVS TDYIGANYEY PDADYAKRQA
IIDDHKRYQQ GLMWTLANHP RVPQEIRDHF QKLGLAKDEF VETDNWPPQL YVREARRMIS
DYVMTQHNCQ RKVTAEDSVG MGAYNMDSHN CQRYVTKEGY VRNEGDIQVG VPPYPISYKS
IRPAKEHVTN LLVPVCLSAS HISYGSIRME PVFMVLGQSA ATAASFAIDG KTTVQDVDYA
KLREKLLEDK QVLEWQGSRG GAAGIVPSSL PGVVVDDALA RTKGEWTASG SISGFVGSGY
QTDGNEQKGE KSALFELKIP KTGSYDIRMS WTPNANRASN VPVVVEAGTL RTETRVNQRN
AAGKDGFHTL GRMTLNAGQT VNVVISNDKT DGHVIIDAVQ ALLAQD