Gene Acid345_2283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2283 
Symbol 
ID4073277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2705889 
End bp2706884 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content58% 
IMG OID637984299 
Productcyanophycinase and related exopeptidase-like 
Protein accessionYP_591358 
Protein GI94969310 
COG category[P] Inorganic ion transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4242] Cyanophycinase and related exopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGATAA TGGCGGTCAT GCATCGTACC CTGTTCACCC TCATCCTCCT GGCATTTTCC 
ATTTCCGCTT TCGCTGCGGA CCCTGCGTAC AAGTACTGGC GGATCGGCAG TCAGAGTGAC
GTCACCACCA AAACCACCGC TGGCTACGCC CTGATGGGTG GCGGCTCCGA CCAGGACCCG
GCCTTCAAAT TCCTTTGCGA CAAATCGGGC GGCGGCGACT TCGTCATCCT TACTGCCAGC
GGCGACAACG ATTACAACGA CTACATCCAG AAGATGTGCA AGCAGAACTC GGTCGCGACG
ATTAAGATCC CCAACGCCGA AGCGGCCAAC GATCCATTCG TGGCCGAAAC GATCCGCAAG
GCCGAAGCTC TCTTCATCTC CGGCGGCGAC CAGTCAAACT ACGTCAAGTA CTGGAAGCCG
AGCCCGATGC GGGCGGCGAT CCAGGACCTG ATTGATCGCG GGGTTCCTGT CGGCGGCACC
AGCGCCGGCT TGGCGATCCT GGGTGAATTC AGCTTCGCGG CGCTCAACGA CACGGCCTAT
TCCGAAAAGA CCTTGAAAAA CCCGTACGAC AACACGGTGA CGATCGACCG CGACTTCCTG
AAGATCAACC ACCTGGAGAA CACCATTACC GATACGCACT TCAAGAAGCG CGACCGGCTC
GGGCGCACCC TGGTCTTTCT GGCGCGCATT CTGCAAGACG GACAGGCCAA GGACATCCGC
TCGATTGCGC TCGATGAGAA GAGCGCCGCG CTGATGGAAC CAGACGGTAC GATGACTGTC
GTCGGGAAAG GCACGGGCGC GTACTTCTAT CACCCTACGA CCAAACCGGA GATATGCAAG
GAAGGCGCGC CACTGACATT CACGGGAATT GATGTGTATC ACGTGCCGAA CGACGGGACC
TTCAACGTTG TGTCGTGGAC GGGCAAAGGC GGAAGTGCAT ACACGTTGAA CGTGAAGGAC
GGGGTGATTT CGTCGAGTTC GGGATCGAAC TACTAG
 
Protein sequence
MWIMAVMHRT LFTLILLAFS ISAFAADPAY KYWRIGSQSD VTTKTTAGYA LMGGGSDQDP 
AFKFLCDKSG GGDFVILTAS GDNDYNDYIQ KMCKQNSVAT IKIPNAEAAN DPFVAETIRK
AEALFISGGD QSNYVKYWKP SPMRAAIQDL IDRGVPVGGT SAGLAILGEF SFAALNDTAY
SEKTLKNPYD NTVTIDRDFL KINHLENTIT DTHFKKRDRL GRTLVFLARI LQDGQAKDIR
SIALDEKSAA LMEPDGTMTV VGKGTGAYFY HPTTKPEICK EGAPLTFTGI DVYHVPNDGT
FNVVSWTGKG GSAYTLNVKD GVISSSSGSN Y