Gene Plim_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2036 
Symbol 
ID9138739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2639062 
End bp2640498 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content53% 
IMG OID 
ProductAnthranilate synthase 
Protein accessionYP_003630063 
Protein GI296122285 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0992508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCCCC CTGTTGTCAT TCCCTTGTCT GCGGAAATGC ACCCCCAATT CGTGACCCAT 
CGACTGGCTT CACTCGATGC ACTGTGCGTA CTCGAAAGTG CTGTTCAACA CGCCCATCTG
GGCAGGTACT CGTATGTCAT GGCCGATCCC TGGTTCTTAA GCGATGAGTA CAACGTTCAA
CCCTCGACAG ATCCTTTGAG CCCCATCGCC AAGGTAATGA AGAATCTGCC CTTCGAGCGT
CAGCCGGAAC TCCCACCTTT TCAGGGTGGC TTTGCCGGAG TGCTTTCTTA TGAAGCTGGC
CGGGCATTTG ACCGCATGCC GTCAGCCAGA CATCGCGACT TTCTCATCCC GGATTTCTCG
ATGAGTTTTT ACGATGTCGT TTTTGCCTGG GATCATATTG AGAATCGAGG TTGGTTGATC
TCTCAAGGCT GGCCCGAAAT CAAACCGGCT GCTCGTGCAA CAAGAGCGAC AGCGCGGGCC
CGACAATTTA TGAATTTGCT TGAGCAGAAG GTCACTGATC TAACTCCCCA GATTCCTCCA
AAGGGAGACA TAAGCTATCG CTCTTTGGAA AGCCTCTCTG CACCGTCGTT TGCATTAGCT
GGACACGACA ACATCTGGAG TAACTTTCGC AGGGAGGATT ATCTTCAGGC TGTAGATCGT
GTGATCGAGT ACATCCGGGC CGGCGACATC TTTCAGGCCA ACCTGACCCA GCGTCTATTG
ACTCCACAAA CCATACCCAG CCTGCAGATC TGGGAGAGGC TCCGTCAGCA CAATCCCTCG
CCATTCATGG CATTCTGGCA AAGGCCCAAA TGGTCGTTAT TGAGTGCTTC TCCTGAACGA
TTTCTCAAGA TCGAAAAGCA AAGCGTTGAG ACACGCCCCA TCAAGGGGAC ACGCCGGAGG
CAAGGTGCTG AAGCGGATTT ATTTCTACGC GACGAATTGC GAGAAAGCCG CAAAGATGTA
GCCGAGAATG TGATGATTGT GGATCTGCTG AGGAACGATC TCTCACGGGT TTGTCGTGCT
GGAAGTGTGA ACGTTCCTGC ATTATGCGAA GTGGAAACCT ATCGGACAGT TCAACATCTC
GTGAGTGTTG TCACGGGAGA ACTCTCCGAG CCTTTTGACT CCTGGGACGC CTTGCGGGCC
TGTTTTCCCG GCGGTTCGAT CACGGGGGCA CCGAAAGTTC GAGCGACGGA AATCATTGCT
GAACTGGAAC CCACCACACG AGGCCCTTAC ACAGGAACAT TGTTCTACAT GACCCCCGAT
CTCTGGTGCG ACAGCAGTAT TCTCATTCGC ACATTCATCG CGAGCGAGGG CTGGCTGCAA
TTGGGAGTCG GGGGTGGCAT TGTCGTCAAT TCCGACCCGG TTCAAGAGTT CGAGGAGACA
TTAACCAAAG CCTCCGGAAT GCTTGCGGCT CTGACTCGCC CCCATTCGAG CGAGTAA
 
Protein sequence
MIPPVVIPLS AEMHPQFVTH RLASLDALCV LESAVQHAHL GRYSYVMADP WFLSDEYNVQ 
PSTDPLSPIA KVMKNLPFER QPELPPFQGG FAGVLSYEAG RAFDRMPSAR HRDFLIPDFS
MSFYDVVFAW DHIENRGWLI SQGWPEIKPA ARATRATARA RQFMNLLEQK VTDLTPQIPP
KGDISYRSLE SLSAPSFALA GHDNIWSNFR REDYLQAVDR VIEYIRAGDI FQANLTQRLL
TPQTIPSLQI WERLRQHNPS PFMAFWQRPK WSLLSASPER FLKIEKQSVE TRPIKGTRRR
QGAEADLFLR DELRESRKDV AENVMIVDLL RNDLSRVCRA GSVNVPALCE VETYRTVQHL
VSVVTGELSE PFDSWDALRA CFPGGSITGA PKVRATEIIA ELEPTTRGPY TGTLFYMTPD
LWCDSSILIR TFIASEGWLQ LGVGGGIVVN SDPVQEFEET LTKASGMLAA LTRPHSSE