Gene Plim_2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2069 
Symbol 
ID9138772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2681505 
End bp2684501 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content59% 
IMG OID 
Productvalyl-tRNA synthetase 
Protein accessionYP_003630095 
Protein GI296122317 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCG AAATTCCCAA GTCTTACGAG CCGCAATCGA TTGAGTCGCA TTGGATTTCT 
TTCTGGGAAT CCCACGGCTA CTACAATGCT GATCCCAGCG AGACCAAACC GCCGCATGTG
ATCATGATCC CCCTGCCGAA CGTGACTGGG GCGCTACACA TGGGGCATGC CCTTAACGGG
ACGTTGCAGG ATCTCATCAC CCGCTGGCGG CGGATGCAGG GGTATTCGGC ACTCTGGATG
CCGGGAACCG ATCATGCGGG GATCGCTACT CAGTCGATGG TCGAGAAGCG GATGCTCGAG
GAGGAAGGGC TCACTCGTCA CGATGTGGGC CGCGAGGCAC TCATCGAGCG CATCTGGAAA
TGGAAGGATC ACTACGAGGC CCGCATTCTC GGCCAGCTCA AGCGGCTGGG GGCGAGCTGC
GACTGGCGAC GGACGCGCTT CACGCTCGAC GAGGTCTGCT CGCAGGCCGT GCGGCGAACG
TTCTTCAAGA TGTTCCGCGA CGGCCTCATC TATCGTGGCC AGCGGCTCGT CAACTGGGAT
GCCTTTTTGC AGACCGCTGT CGCCGACGAC GAAGTCTATA CCGAAGACAT CGACGGGCAG
TTCTGGACGT TCAACTACCC CGTGGTTGAC GATCAGGGGG AACCAACCGG CCAGCGAATT
TCATTCTCGA CGACACGTCC GGAAACGATG CTGGGCGATA CGGCACTCTG CGTCCATCCC
ACTGATGAAC GCTATACGGC GCTGGTGGGG AAGCACGTCC GGCTGCCGCT CGTCGGCAGG
CTTGTGCCGA TCATTGCCGA TGGACTGCTG GCTGATAAAG AGTTGGGGAC GGGCTGTGTG
AAGGTGACTC CCGCGCATGA TCCCAACGAT TACGCCTGCG GTCTGCGCAA CAAGCTTCCG
ATGATCAACA TCCTGCGGCC CGATGGCACC ATCAATGAAG AAGGTGGCGA GTTCGCCGGG
CTGGATCGTC TCGAAGCGCG CAAGGCCGTG GTTGCCAAGA TGGAATCTCT GGGATTCTTT
GAGAAGGTCG AAGACCGCAA GATTCCCATG AAGTTCAGTG ATCGGTCGAA GACGCCCGTT
GAGCCGCTGA TGTCCGATCA GTGGTTCGTG AAGATGGACG ACCTCGCCCA GAAGGCGATC
GACGCCGTGA CCGATGGCCG CGTGAAATTT TTCCCCGAAC GGTACCAATC GAGCTATCTC
GACTGGCTTG GAGAGAAGCG CGACTGGTGC ATCAGCCGCC AGTTGTGGTG GGGGCACCGG
ATTCCCGTGT GGAGTAAGCA GTTCGCTAGC GCGGAGGAAG CCGAGTCGTA TCTTGCGACT
CTTCCCGACA GTTCGGCGGC TGGTGTTCTT TCCGCCAGCG ATCCCGCGAG CGTGCTGATT
TGCGTGGACG CTTCGGCAGC CGATGGAGAA CAGACCCAGA ATCAACTCGA AGCAGACGGA
TTTGAACAGG ATCCCGATGT CCTCGACACC TGGTTCAGCA GTGCACTTTG GCCGCATGCG
ACGCTGGGTT GGCCGAATGA AACGTCGAAT CCACCACTCA ATGGCCAGCC AGATACAAGC
GGCGACGGGA AGAATACGGT CCTCCCCTAC TACTATCCGG GCAGTGTGCT GATCACCTCG
CGGGATATCA TCACGCTCTG GGTCGCGCGA ATGGTGCTAG CAGGTCTCTA CAACCTGAAT
GACATCCCCT TCAAACATGT GCATATCCAT CCCAAAATTC AGGACGGATT TGGGCAAGGC
ATGTCCAAGA CCAAAGGGAA CGGTGTTGAT CCGCTGGAAC TGGTCGATCG TTACGGCTGC
GACGGAACTC GCTTCACGAT CGCCTCGTTT GCGGGTGAAA CGCAGGACGT GCGCTTGCCG
GTCAGCTACG AATGCCCCCA TTGCCAGAAC CTGATTCCCC AGGAGCAGAA GCACCTGAAG
TTTTCGCCCG GAAAGCCGAA GATCAAGTGC CCCAAGTGCA AGAAGGAATC GCAGTACGCC
TGCCCCTGGT ACACACCGGA TGAGGGCGAA CTGGTGGCGG GGATTGTTAT CGAACGCTTC
GAGTTTGGCC GCAATTTCTG CAACAAACTC TGGAACGCCG CCCGTTTCGC CATGCTGAAT
CTGGAGGGCT ACACACCTGC TGCTGTGGCC AAGAGTGAAC TGGCGATTGA AGACCAGTGG
ATCGTCAGTC GCCTGGCGAC GGTGACGAAC GAAGTGACCT CCCTGCTGGG TGTCTACAAG
TTCGATGCGG CAACGCGCGC CCTGCGGGAT TTCGTCTGGA ACGAGTTCTG CGACTGGTAC
CTCGAACTGA TCAAGTCGCG GTTGCGTGAC GAGACGACCA AGCCGGTCGC GCAGCGGGTG
CTCGTGCATG TTCTCGATCA GATCCTGCGG TTGTTGCACC CGTTCACGCC GTTCATCACG
GAAGAACTGT GGCATCGCCT CGCCGAGATC GCCCCTTCAC GCGGCTTGCC TGAACCCCAG
CCGGCGGAAG CCGCCTGCAT TATCGCGGCC TGGCCCGTAG TGAACGACGC CGACATCTCC
CCTCCCCTCG AACAGCGGTT CACCCGGTTG CAGGAGACAA TCGGTGCGAT TCGGAATATT
CGCGCGACCT ATGGCATCAG CCTCGGCCAA TCGATTGCGG TTCATCTGAA GTGCCGTGCT
GAAGCTGCCG CTGACTTTGA GGCCTTACGA GTCCAGATCC AGAATCTGGC CAAAGCGGAG
ATTGCGGCGA CTGGCCCCGA GGTGCAGCGT CCCCCTGCTT CTGCGAGTTT TGCACTCGTG
GGAGCCGAGG GCTTCGTGCC GCTGGAAGGC CTGATTGATA AGGCGGCAGA ACTGGCCAAG
CAGAAGAAAG AGGCCGAGAA GCTGCGCGGC TTCATCGCTT CGACCGAAAA GAAGCTGGGG
AACGCCAGCT TCGTCGACAA GGCCCCACCG GAAGTGGTGG CCGAAGTTCG CCAGACGTTG
GCGAATCAGA AGAGTCAACT GGCAAGTATT GAAGAGATTA TCCGTCAGTT GCAGTGA
 
Protein sequence
MTTEIPKSYE PQSIESHWIS FWESHGYYNA DPSETKPPHV IMIPLPNVTG ALHMGHALNG 
TLQDLITRWR RMQGYSALWM PGTDHAGIAT QSMVEKRMLE EEGLTRHDVG REALIERIWK
WKDHYEARIL GQLKRLGASC DWRRTRFTLD EVCSQAVRRT FFKMFRDGLI YRGQRLVNWD
AFLQTAVADD EVYTEDIDGQ FWTFNYPVVD DQGEPTGQRI SFSTTRPETM LGDTALCVHP
TDERYTALVG KHVRLPLVGR LVPIIADGLL ADKELGTGCV KVTPAHDPND YACGLRNKLP
MINILRPDGT INEEGGEFAG LDRLEARKAV VAKMESLGFF EKVEDRKIPM KFSDRSKTPV
EPLMSDQWFV KMDDLAQKAI DAVTDGRVKF FPERYQSSYL DWLGEKRDWC ISRQLWWGHR
IPVWSKQFAS AEEAESYLAT LPDSSAAGVL SASDPASVLI CVDASAADGE QTQNQLEADG
FEQDPDVLDT WFSSALWPHA TLGWPNETSN PPLNGQPDTS GDGKNTVLPY YYPGSVLITS
RDIITLWVAR MVLAGLYNLN DIPFKHVHIH PKIQDGFGQG MSKTKGNGVD PLELVDRYGC
DGTRFTIASF AGETQDVRLP VSYECPHCQN LIPQEQKHLK FSPGKPKIKC PKCKKESQYA
CPWYTPDEGE LVAGIVIERF EFGRNFCNKL WNAARFAMLN LEGYTPAAVA KSELAIEDQW
IVSRLATVTN EVTSLLGVYK FDAATRALRD FVWNEFCDWY LELIKSRLRD ETTKPVAQRV
LVHVLDQILR LLHPFTPFIT EELWHRLAEI APSRGLPEPQ PAEAACIIAA WPVVNDADIS
PPLEQRFTRL QETIGAIRNI RATYGISLGQ SIAVHLKCRA EAAADFEALR VQIQNLAKAE
IAATGPEVQR PPASASFALV GAEGFVPLEG LIDKAAELAK QKKEAEKLRG FIASTEKKLG
NASFVDKAPP EVVAEVRQTL ANQKSQLASI EEIIRQLQ