Gene Acid345_0602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0602 
Symbol 
ID4069635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp733007 
End bp734215 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content60% 
IMG OID637982607 
Productglycosylasparaginase 
Protein accessionYP_589681 
Protein GI94967633 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1446] Asparaginase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.982987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTTTT CCCGCCGTGA ATTTATCGCG ACGTCTGCAA TTGGTTCGGC TTCTCTTGCA 
CTTGATCTGA ATGCGCAATC CAACAATGCT CCTAAGGCTG CCGGGCCGGG GAAGAACATC
CTGATTTGCT CGGCGAATGG TCACAATTAT CTCGATCGCG GACACGCCGT ACTGGAGAAG
GGCGGCGACA CGCTGGACGC GATCATGGAA GTGGTTCGCG GGCCGGAAGA AGATCCGGAA
GACGACAGCG TAGGTTACGG CGGGTTGCCG AACGAAGAAG GCGTGGTCGA ACTCGATTCA
TGTTGTATGC ATGGACCAAC GCGGCTCGCC GGATCTGTCG GTGGCGTACA CGACATCATG
CATGTGGCTC TGCTGGCGAA GACAGTGATG GAGCACACCG GGCACGTGAT GCTGGTTGGC
GAAGGCGCGA AGCGTTTTGC GGTCGCGCAC GGCTTCCCAA CGATGAACCT GCTCACTGAG
CACTCGCGCA AAGTCTGGCT GCTGTGGAAG GAGACAAATT CCAATCAGGA CTGGTGGGGG
CCAGGACCGG CGAGCCCGCA CTTCAAGTTC CCGACGAATG GAACGAAGTC GGAGGACCTG
AAAGAACGTA TCCGTGAGAT GGAAAAGCTG GCGGAGCAGA TCGGAATCGA GCCGGAGCGG
CGGATGGCGG CGATCCATCG CGTTCTGTAT CCACCGACGG GCACGATCAA CTGCTCGGCG
CTGAAGGCGA ACGGCGAGAT GAGCGGCGCC ACCACAACCA GCGGACTGGC GTGGAAGATC
CCTGGGCGCT GCGGTGATTC GCCGATCATC GGCGCGGGCT GCTACTGCGA CCAGGACGTG
GGTTCTGCGG GAGCGACGGG CAGCGGCGAA GAGAACATCA AGATCGCCGG CGCGCACACG
ATCGTGGAGA ACATGCGCCA TGGCATGTCG CCAAAAGAGG CGGGCATGGA TGCGCTGAAG
CGGATCGTGC GGAACTATAA CGGGGACATG GCGCGCCTGA AGTACGTGAG CATGAAGTTC
TACATCCTGC GCAAAGACGG CGAGCATGCC GGCGTTTCGA TGTGGAGCGG GACGAAAGAA
GCTCCGTCGA AGTTCGCGAT CCACGATGGG ACGGCGCGGT TCGAGAATGC GGCGTACCTG
TATGAAGGCG AGCCGCAGGA GTGGCCGCCG ATGCCGGAGT TGCAGACTTC GACGTATTCG
ACGCTTTAG
 
Protein sequence
MKFSRREFIA TSAIGSASLA LDLNAQSNNA PKAAGPGKNI LICSANGHNY LDRGHAVLEK 
GGDTLDAIME VVRGPEEDPE DDSVGYGGLP NEEGVVELDS CCMHGPTRLA GSVGGVHDIM
HVALLAKTVM EHTGHVMLVG EGAKRFAVAH GFPTMNLLTE HSRKVWLLWK ETNSNQDWWG
PGPASPHFKF PTNGTKSEDL KERIREMEKL AEQIGIEPER RMAAIHRVLY PPTGTINCSA
LKANGEMSGA TTTSGLAWKI PGRCGDSPII GAGCYCDQDV GSAGATGSGE ENIKIAGAHT
IVENMRHGMS PKEAGMDALK RIVRNYNGDM ARLKYVSMKF YILRKDGEHA GVSMWSGTKE
APSKFAIHDG TARFENAAYL YEGEPQEWPP MPELQTSTYS TL