Gene Acid345_3388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3388 
Symbol 
ID4072724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4009963 
End bp4010961 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content60% 
IMG OID637985410 
Producthypothetical protein 
Protein accessionYP_592463 
Protein GI94970415 
COG category[R] General function prediction only 
COG ID[COG2842] Uncharacterized ATPase, putative transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTCA GTATGAAAGA TCGCGAACTT TTGCTGGCAA CTTCTTTGCC CGCCGCGAAC 
GCCGTACGCG AGCAGCTGAA CGAGTATCTG GCACGCACCG GCCTCGCTTA TTCCGATTTC
GCCAGGCGAA TCAACTACTC GTCGGTGACC CTCCGCTTCT TCATCAAGGG ACGCTACGCC
AACATCGCCT CCAACGATGC CCCTCTTCGC AAGGCCATTA CCGAATTCAT TGCTGCGCAC
CCAATCGAAC CCGTTACGCA GGCAGGCGAC AAGCTCTATG AGACGGAGAA CGTGCAGCTG
CTGCGCCAGT ACTTCTACGA GGCGCTCGAC GACTGCCGCA TGATCTACGT CCACGGCGCG
CCGGGATCGC AGAAGACGTT CGTGCTCGAA CACCTGACCG CGGAACTCAA CTGTGCCGAA
GTATCGAAGA ACGGTCACGG CCGCCGCGCC TATTACGTAT ATTGCCCACA GTCGGTGAAG
ACCTCGCAGA AGATCATGCG CGAGATCGCC GAGGCCTGCG GTGTCGATAC CACCGGCGAC
GCGCAGCGCA TCCTGAAGCG GCTGCGCTTC GAGTTTCGCA CCCGCAAGGT GATCTTCATC
CTCGACGAGG CACAGCACCT CAACTACGAG TGCCTCGAAA CCATCCGCGG GCTCTTCGAT
CGCATGCCGC ACTGCGCGAT CCTGCTCGCC GGTTCGCACC AGCTCGAAAC CACTTTTATG
CGCGATGCCG CGCGGCTCGA ACAGTGGAAC TCGCGGCTGC ACTTCGGCAA GGCTCTGCCG
GGAATCTCCG ACGACGAGGC CGACACGATC ATCCGCCAGG AACTCGGCGA GAAGGTGACG
TCGCCGATCG TTCGCAAGCT CATCACCGAA TCCAAGGCCC TCGATGTTCG CCGCTCCGGT
GAACACAACT ACATCTCAGC CCGGCGTCTG TTCTGGTCCA TCCGCGACAT CAAGCGCGCG
ATAGAGAAAC GCCAGGCCAA GAAAGAAGCC TCCGCATGA
 
Protein sequence
MALSMKDREL LLATSLPAAN AVREQLNEYL ARTGLAYSDF ARRINYSSVT LRFFIKGRYA 
NIASNDAPLR KAITEFIAAH PIEPVTQAGD KLYETENVQL LRQYFYEALD DCRMIYVHGA
PGSQKTFVLE HLTAELNCAE VSKNGHGRRA YYVYCPQSVK TSQKIMREIA EACGVDTTGD
AQRILKRLRF EFRTRKVIFI LDEAQHLNYE CLETIRGLFD RMPHCAILLA GSHQLETTFM
RDAARLEQWN SRLHFGKALP GISDDEADTI IRQELGEKVT SPIVRKLITE SKALDVRRSG
EHNYISARRL FWSIRDIKRA IEKRQAKKEA SA