Gene Plim_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2000 
Symbol 
ID9138702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2595031 
End bp2596218 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content54% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003630029 
Protein GI296122251 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATC CAACGCCGAT TACGACTTTC GAGCCCGAAC CTGAACCCGA GCGACCAGCG 
GCTGGAGGCA GCTCACGAGT CATCAACGAT GGTCAGGCTC GTGTCTTTCC TTGCAGTCAA
TGCGGGGCTG ATCTCGAATT CCGGATTGGT ATCCAGCAGC TTCAGTGCCC ATTTTGCAAT
CATGTGGAAC AACTGGAGAT CCCTGCCGAT GCTGCCATCG TCGAGCAGGA TCTCGACGCC
ATGCTTGAGC GATTGCAAGA GCAGCATGGA GATCTCGAAT CCGCGGAAGG AGAGGTCAAT
AGTGCCACCG TCGGCGACCA GGAAGTTCAT TGCGATGGTT GCGGCGCAAA CGTCCTCTTC
GTGGGCACTC TCACCAGCAG CCGCTGCCCT TACTGCGGCA GCCCAATTCA ACGTAATGAT
GTCCATAAAT CCGCAGCTCG CATTCCGGTC GATGGGGTGC TTCCCTTTTT TATCGTTCGC
GAGAAAGCCG CCAGTTGTAT CGAGCAGTGG GTTCAATCCC GCTGGTTCGC TCCTAACGAT
TTCAAGAAAC TGGGAGCCAA AGGGAAATTC GAAGGCGTTT ATCTCCCTTA TTTCACATTC
GATGCCATGA CATTTAATCG TTATCAGGGC GAACGTGGCG ATCGCTACAC TGTCACTGTC
GGAACTGGTA AAGATCGACG CACTGAAACC CGCACAAGAT GGTCTTACGC TTCCGGTCAG
TTCCAGCGAT TCTTTGACGA TGTGCTGATT CTCGCGATTC GTTCGCAGCG ACATGATCTT
GCTCAGCATC TGGAACCCTG GCCTCTGGAA AAATGCGTCC CCTTTACCCC CGATGCCATG
GCTGGAATTT TTGCGAGAAC GTATGACATT CCGCTCGATC AATCGTTCGA ACTGGGCCAG
CAAAGAATGC GGCAAGCCTT AATGGCCGAA ACCCGCCAAA GGATTGGTGG CGACGAGCAA
CGTGTCCATG ACCTGAAAAC TCAGTTTACG GCACTCACGT TCAAACACCT TCTGTTGCCT
GTCTGGTTGC TGGCCTACCG CTATCGCGAC AAGACCTACC CCGTCATGGT GAATGCCGTC
ACAGGCGAAG TGAGTGGCGA TCGCCCTTAC AGTTGGATCA AGATTACTCT GGCCATCCTC
GCTGCGGCAG CCGCGGCATT GACTCTGTTT GCATTGACAC AAAAGTGA
 
Protein sequence
MSDPTPITTF EPEPEPERPA AGGSSRVIND GQARVFPCSQ CGADLEFRIG IQQLQCPFCN 
HVEQLEIPAD AAIVEQDLDA MLERLQEQHG DLESAEGEVN SATVGDQEVH CDGCGANVLF
VGTLTSSRCP YCGSPIQRND VHKSAARIPV DGVLPFFIVR EKAASCIEQW VQSRWFAPND
FKKLGAKGKF EGVYLPYFTF DAMTFNRYQG ERGDRYTVTV GTGKDRRTET RTRWSYASGQ
FQRFFDDVLI LAIRSQRHDL AQHLEPWPLE KCVPFTPDAM AGIFARTYDI PLDQSFELGQ
QRMRQALMAE TRQRIGGDEQ RVHDLKTQFT ALTFKHLLLP VWLLAYRYRD KTYPVMVNAV
TGEVSGDRPY SWIKITLAIL AAAAAALTLF ALTQK