Gene B21_00813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00813 
SymbolyliA 
ID8113244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp850884 
End bp852722 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content54% 
IMG OID644847079 
Producthypothetical protein 
Protein accessionYP_002998652 
Protein GI251784348 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.991571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGGCGG TTGAAAATCT GAATATTGCC TTTATGCAGG ACCAGCAGAA AATAGCTGCG 
GTCCGCAATC TCTCTTTTAG TCTGCAACGC GGTGAGACGC TGGCAATTGT TGGCGAATCC
GGCTCCGGTA AGTCAGTGAC TGCGTTGGCA TTGATGCGCC TGTTGGAACA GGCGGGCGGT
TTAGTACAGT GCGATAAAAT GCTGTTGCAG CGGCGCAGTC GCGAAGTGAT TGAACTTAGC
GAGCAGAACG CTGCACAAAT GCGCCATGTT CGCGGTGCGG ATATGGCGAT GATATTTCAG
GAGCCGATGA CATCGCTGAA CCCGGTATTT ACTGTGGGTG AACAGATTGC CGAATCAATT
CGTCTGCATC AGAACGCCAG TCGTGAAGAA GCGATGGTCG AGGCGAAGCG GATGCTGGAT
CAGGTACGCA TTCCTGAGGC ACAAACCATT CTTTCACGTT ATCCGCATCA ACTCTCTGGC
GGGATGCGCC AGCGAGTGAT GATTGCGATG GCGCTGTCAT GCCGCCCGGC GGTGCTGATT
GCCGATGAGC CAACCACCGC GCTGGATGTC ACTATTCAGG CGCAGATCCT GCAATTAATC
AAAGTATTGC AAAAAGAGAT GTCGATGGGC GTTATCTTTA TCACTCACGA TATGGGCGTG
GTGGCAGAGA TTGCCGATCG GGTACTGGTG ATGTATCAGG GCGAGGCGGT GGAAACGGGT
ACCGTCGAAC AGATTTTTCA TGCACCGCAA CATCCTTACA CCCGTGCGCT GTTAGCTGCT
GTTCCGCAAC TTGGTGCGAT GAAAGGGTTA GATTATCCCC GACGTTTCCC GTTGATATCG
CTTGAACATC CAGCGAAACA GGCCCCCCCC ATCGAGCAGA AAACGGTGGT GGATGGCGAA
CCTGTTTTAC GAGTGCGTAA TCTTGTCACC CGTTTCCCTT TGCGCAGCGG TTTGTTGAAT
CGCGTAACGC GGGAAGTGCA TGCCGTTGAG AAAGTCAGTT TTGATCTCTG GCCTGGCGAA
ACGCTATCGC TGGTGGGCGA GTCTGGCAGC GGTAAATCCA CTACCGGGCG GGCGTTGCTG
CGCCTGGTCG AATCGCAGGG CGGCGAAATT ATCTTTAACG GTCAGCGAAT CGATACCTTG
TCACCCGGCA AACTTCAGGC ATTACGCCGG GATATTCAGT TTATTTTTCA GGACCCTTAC
GCTTCGCTGG ACCCACGTCA GACCATCGGT GATTCGATTA TCGAACCGCT GCGTGTACAC
GGTTTATTGC CAGGTAAAGA CGCGGCTGCA CGCGTTGCGT GGTTGCTGGA GCGCGTGGGC
CTGTTACCTG AACATGCCTG GCGTTACCCG CATGAGTTTT CCGGCGGTCA GCGCCAGCGC
ATCTGCATTG CTCGCGCGTT GGCATTGAAT CCAAAAGTGA TCATTGCCGA CGAAGCCGTT
TCGGCGCTGG ATGTTTCTAT TCGCGGGCAG ATTATCAACT TGTTGCTCGA TCTCCAGCGT
GATTTCGGCA TTGCGTATCT GTTTATCTCC CACGATATGG CGGTGGTAGA GCGGATTAGT
CATCGTGTGG CGGTGATGTA TCTCGGGCAA ATTGTTGAAA TTGGTCCACG GCGCGCGGTC
TTCGAAAACC CGCAGCATCC TTATACGCGT AAATTACTGG CGGCAGTTCC GGTCGCTGAA
CCGTCCCGAC AACGACCGCA GCGTGTACTG CTGTCGGACG ATCTTCCCAG CAATATTCAT
CTGCGTGGCG AAGAGGTGGC AGCCGTCTCG TTGCAATGCG TCGGGCCGGG GCATTACGTC
GCACAACCAC AATCAGAATA CGCATTCATG CGTAGATAA
 
Protein sequence
MLAVENLNIA FMQDQQKIAA VRNLSFSLQR GETLAIVGES GSGKSVTALA LMRLLEQAGG 
LVQCDKMLLQ RRSREVIELS EQNAAQMRHV RGADMAMIFQ EPMTSLNPVF TVGEQIAESI
RLHQNASREE AMVEAKRMLD QVRIPEAQTI LSRYPHQLSG GMRQRVMIAM ALSCRPAVLI
ADEPTTALDV TIQAQILQLI KVLQKEMSMG VIFITHDMGV VAEIADRVLV MYQGEAVETG
TVEQIFHAPQ HPYTRALLAA VPQLGAMKGL DYPRRFPLIS LEHPAKQAPP IEQKTVVDGE
PVLRVRNLVT RFPLRSGLLN RVTREVHAVE KVSFDLWPGE TLSLVGESGS GKSTTGRALL
RLVESQGGEI IFNGQRIDTL SPGKLQALRR DIQFIFQDPY ASLDPRQTIG DSIIEPLRVH
GLLPGKDAAA RVAWLLERVG LLPEHAWRYP HEFSGGQRQR ICIARALALN PKVIIADEAV
SALDVSIRGQ IINLLLDLQR DFGIAYLFIS HDMAVVERIS HRVAVMYLGQ IVEIGPRRAV
FENPQHPYTR KLLAAVPVAE PSRQRPQRVL LSDDLPSNIH LRGEEVAAVS LQCVGPGHYV
AQPQSEYAFM RR