Gene Acid345_1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1685 
Symbol 
ID4069353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2042549 
End bp2043949 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content59% 
IMG OID637983693 
Productamidase 
Protein accessionYP_590760 
Protein GI94968712 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.992424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.16071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGCC TAACAACCCG TTCAGCCACT GAGCTTCTGG AACTTCTTCG TAAAAAAAAG 
CTTTCACCGC TCGAACTCGT GGAAGAGCAC ATCCACCAGA TCGAGCGGCT GAATCCCAAG
CTCAACGCGC TGGTGGACTT CGATCCTGAG CGCGTCCGGG CCCAAGCCCG CAAGGTTTCT
GCGCACGAAG GCCCGCTCGC AGGGTTGCCA GTCACTGTGA AGTCTTCGAT TGCGGTCGCG
GGCCACAAAT GCGAACTCGG CAGTGGGTTC TACCGGAACA ACATTCCCTC GGAAGATGCG
ACGGTGGTGG CACGTATGCG CGCCGCGGGA GCCGTGATCC TCGGGACTAC CAATGCTCCC
GAACTGCTGA TGTCCTACGA GACGGCTAAC GATCTCTACG GTCGCACTTT GAATCCATGG
AACATCGAGT ACTCTGCGGG TGGATCCAGC GGGGGCGAAT CGGCGGCAAT CGCGGCAGGG
ATGTCCGCGG CCGGGCTGGG CAGCGATAGC GGTGGTTCGG TCCGTCAGCC AGCGCATGCT
ACGGGCATTT GCGCGCTCAA GCCGACGCCG GGGCGGATTC CTGCTACCGG CCACATTCCC
GCCTGTCTCG GTCCGTTCGC GACGCTTGGC GCGATAGGTC CGATGGCACG GACGATGCAA
GATGTGTCGT TGCTGTTTAG CGTCCTGTCG GGGCAGGACC TCGACGATCC TGCTTCCGCG
CCGGTGCCGT TGTGCACTCC ATCGATCACC GAACTCAAGC AAATTCCGAT TGGCTATTTT
GAGGATGATG GCATCGTTCC TGTCACTCCG GAGACGCGTT TCGCAATCCA GTCCGCAGTT
GATGCGCTAC GGCGCGCGGG ATTTCGGGTT GAACCATTTC GACCACGAAC TCTCGAAGCA
GCACGAAAGA TCTGGTGGAC GTTCTTTGTC CGCTGCGGCT TTGCCTTCGA CGAAGCGATT
ATTCAAGGCC GTTACGAAAA ACTAAGTCCA ACATTCAAAG ACTTTATGGC GACTGCGCAA
GCGGAGCCGC CACTCGAAAG CAAGGAATTA CTTTTTGCCT GGGCCGAAGG CGACATGATC
CGCGCGAAAA TGCTCGCTGA GATGCGCGAC TATCCCGTGT GGCTGTGCCC TGTTTGCGCC
ATTCCGGCTT TCCGGCATGA CGAGCGCGAA TGGATAGTGG AAGGGAAGAC CGTTCAATAT
CTTGACGCGA TGCGTTACAT GCAGTGGTTC AACACGTTCG GCGCTCCGGC AGCAGTCGTG
CCGGTGGGAG CCTCTCCCGA AGGTCTACCA ATCGGGGTAC AGATCGCAGC TCGACCTTAC
GCAGATGAGA TTGTGTTGGG AATCGCCGAG GTGATCGATC GTGAGTTTGG CTATCGAGTA
CCGCCAATTG CGGAAAGCTA G
 
Protein sequence
MSGLTTRSAT ELLELLRKKK LSPLELVEEH IHQIERLNPK LNALVDFDPE RVRAQARKVS 
AHEGPLAGLP VTVKSSIAVA GHKCELGSGF YRNNIPSEDA TVVARMRAAG AVILGTTNAP
ELLMSYETAN DLYGRTLNPW NIEYSAGGSS GGESAAIAAG MSAAGLGSDS GGSVRQPAHA
TGICALKPTP GRIPATGHIP ACLGPFATLG AIGPMARTMQ DVSLLFSVLS GQDLDDPASA
PVPLCTPSIT ELKQIPIGYF EDDGIVPVTP ETRFAIQSAV DALRRAGFRV EPFRPRTLEA
ARKIWWTFFV RCGFAFDEAI IQGRYEKLSP TFKDFMATAQ AEPPLESKEL LFAWAEGDMI
RAKMLAEMRD YPVWLCPVCA IPAFRHDERE WIVEGKTVQY LDAMRYMQWF NTFGAPAAVV
PVGASPEGLP IGVQIAARPY ADEIVLGIAE VIDREFGYRV PPIAES