Gene Lcho_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3031 
Symbol 
ID6162468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3348587 
End bp3350452 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content73% 
IMG OID641665806 
Productallophanate hydrolase 
Protein accessionYP_001792056 
Protein GI171059707 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR02713] allophanate hydrolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.153598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATATC CCTTGTTGTA TACCAGTGAG TGCACCAAAA CCGTGACATC CACCACACAA 
GCCCCCCGTA CCCTGACGCA ATGGCAAGAG GCCTACCGGG CCGGAGCCGA GCCGGCGGAC
CTGCTGCCGG CGCTGCGACA TCGGTTGATC CGTGGTGACG ATCCGGCGGT GATCCGCTGG
GTGACGGGCG ACGAGTTGGC ACGCCGCCTC GGCCAGCTGG CCGAGGTGGC CGCGGCCCAT
GCCGACCGCG CCGCGCTGCT CAAGGTGCTG CCGCTGTTCG GCGTGCCGTT CGCGGTCAAG
GACAACATCG ACATCGCCGG CATCGAGACC ACCGCCGCCT GCCCGGCCTT TGCGCACGTG
GCCGGCCAGT CGGCCGAGGC CGTGCGCCGG CTCGAAGCAG CCGGTGCGGT GTGGATCGCC
AAGACCAACC TCGACCAGTT CGCCACCGGC CTGGTCGGCG CTCGCAGCCC CTACGGCCGG
CCAGCCAGCG TGTTCGACGC CGCGCGCATC AGCGGCGGAT CCAGTTCGGG CTCGGCGGTG
CTGACCGCGC GTGGCGACGT GGCTTTTGCG CTCGGCACCG ACACCGCCGG CTCGGGTCGC
GTGCCGGCGG GTTTCAACGA ACTGGTGGGC CTCAAGCCGA CGCCCGGGCG GGTCAGCACC
GCCGGCGTGC TGCCGGCCTG CCGCAGCCTC GATTGCGTGT CGGTGTTCGC CCACTCGGTC
GAGGACGCCG CCGTGGTGCT GTCGGTGATC GAAGGCGCCG ATGCGGCCGA TGCCTACAGC
CACTTCGCGC CCGGCCCGTC GACCTGGGCG CCGCGCCTGA AGGTGGGCGT GCCGCGCGTG
CCGTTCTTCT TCGGCGACGC GGGTTACGAG GCCGCGTGGT CCTGGGCCGT GGCGCAGATG
GCCGCGCTCG GCCACGAGAT CGTCGCGCTC GACTTCGCGC CGCTCGACGA AGTGGCGGCT
CTGCTCTACG ACGGCCCGTG GGTGGCCGAG CGCCATGCGG TCGTCGCGGC GCTGCTGAGC
GCGCAACCCG ATGCGCTCGA CGCCACCGTG CGCCGCGTCA TCACGCGCGC CGTCGGCATG
AGCGCCACCG ACGCCTTCCG CGGCCTCTAT CGCCTGCAGG ACCTGAAGGC GGCGGGGGAG
GCCACCTGGT CGCGCTGCGA CCTGCTGATG GTGCCGACCG CGCCCGGCCA TCCGCGTTTC
AGCGAACTCG ATGCCGACCC GGTGGGCGTC AACTCGCTGC TCGGCCGCTA CACCAACTTC
GTCAACCTGC TGGGCTGGTG CGCGCTGGCG CTGCCGGCCG GGCGCACCGC GGTTGGCCTG
CCGTTCGGCG TGACCTTCAT CGCGCCGGGC AATCACGACG CCGCACTGGC GCGTTTCGGC
CTGGGCTGGC AGGCGGCGCA GGGCGTTGCC TCACCGGCGG CGACGCCCGC TCTCTGGCCC
CAGTCCGAGC CCGAGATGGC GATCGCGGTG GTCGGCGCGC ATCTGTCCGG CCTGCCGCTG
AACTGGCAGC TGACCGAACG CGGCGCCACG CTGATCGAAG CCACCCGCAC CGCGCCGCGC
TACCGCCTGC ACGCGCTGCC CGGCACCGTG CCGCCCAAGC CCGGCATGGT GCGCGACAGC
CTGCGCGGCG GCTCGATCGC GCTCGAGGTC TGGCGCATGC CGCAGCGCGC GGTCGGCAGC
TTCCTGGCGC TGATCCCGCA GCCGCTCGGC CTGGGCTCGA TCGAGCTGGC CGACGGCCGC
TGGGTGCACG GTTTCGTCTG CGAGGCCGAA GCCACCGCGC AGGCAAGCGA CATCACCGAG
CTGGGCGGCT GGCGGGCCTA CCTGCAAGCC GTTGCCGCCG CCCTTCCTGT CCCCCCGAGG
AGTTGA
 
Protein sequence
MQYPLLYTSE CTKTVTSTTQ APRTLTQWQE AYRAGAEPAD LLPALRHRLI RGDDPAVIRW 
VTGDELARRL GQLAEVAAAH ADRAALLKVL PLFGVPFAVK DNIDIAGIET TAACPAFAHV
AGQSAEAVRR LEAAGAVWIA KTNLDQFATG LVGARSPYGR PASVFDAARI SGGSSSGSAV
LTARGDVAFA LGTDTAGSGR VPAGFNELVG LKPTPGRVST AGVLPACRSL DCVSVFAHSV
EDAAVVLSVI EGADAADAYS HFAPGPSTWA PRLKVGVPRV PFFFGDAGYE AAWSWAVAQM
AALGHEIVAL DFAPLDEVAA LLYDGPWVAE RHAVVAALLS AQPDALDATV RRVITRAVGM
SATDAFRGLY RLQDLKAAGE ATWSRCDLLM VPTAPGHPRF SELDADPVGV NSLLGRYTNF
VNLLGWCALA LPAGRTAVGL PFGVTFIAPG NHDAALARFG LGWQAAQGVA SPAATPALWP
QSEPEMAIAV VGAHLSGLPL NWQLTERGAT LIEATRTAPR YRLHALPGTV PPKPGMVRDS
LRGGSIALEV WRMPQRAVGS FLALIPQPLG LGSIELADGR WVHGFVCEAE ATAQASDITE
LGGWRAYLQA VAAALPVPPR S