Gene Acid345_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1040 
Symbol 
ID4073127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1306586 
End bp1307599 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content58% 
IMG OID637983047 
Productpeptidase dimerisation 
Protein accessionYP_590117 
Protein GI94968069 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.108705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTG ACGTCGTCGC CCTCACCCGA AAACTAATTG ACGTGGAATC AATTACCGGG 
AACGAAGCAC CGGTGGGTGA GCTCCTGGTT CGCGAACTAT CGGCACTCGG CTACCAAGTC
TCGCGAATGC CGGTCGAAGA AGAACGTTTT AACGTCTGGG CAACCTCTCC CGGTCACCAA
CGTCCTAAGG TCGTCTTTTC GACGCACATG GACGTGGTTC CTCCATGGAT TCCCTCGTCC
GAGGACGAGA AGAACATCTA CGGCCGCGGA GCTTGCGATG CCAAGGGCAT CATCGCCGCG
CAAATCGACG CCGCTGAGAA GTTGCGCACC AAAGGCATTC ACGCCGGACT ACTGTTCGTC
GTCGGCGAAG AACGCGACAG CACCGGCGCC TACGTCGCCA ATTCACACGC GCCGGGTTCG
AAGTTCCTCA TCAACGGCGA GCCCACCGAC AATCGCATCG GTGTCGCCTC TAAAGGCGCG
CTGCGTGTGA ACGTAATTGC GGAAGGAAAA ATGGCGCACT CGGCCTATCC GGAGCTTGGA
GAATCCGCGA TCGAAAAGTT GCTGAATGCG CTTGAACGCC TACGCAAAAT GCCGCTTCCA
GAAAACCCTG AGGTCGGTCC ATGCACGGTA AACATCGGCG TGATCGAAGG CGGCCGCGCA
CCAAATGTCA TTCCCGACCA AGCCAGCGCC CAGCTGCTCT TTCGCCTGGT CGGCCCGTCT
GAACAACTGC GTAAAGACAT CGAAACCGCT ATCGCCCCCG ATGCCCACTG CGAATACGCG
CTTGAGATTC CTTTTGTGAA ACTGCGCACA GTTCCCGACA TTCCGACCAT GACGGCAAAG
TTCACCACTG ACATCCCGCG CTTGAGCAAC TGGGGCGAGC CCGTTCTGCT CGGCCCCGGC
TCGATCCATG TCGCACATAC TCCACGCGAG TTCCTGAGCA AGCAGGAACT GTTTGAGGCC
GTGGAGCTCT ATGTGAAAGT CGCCGAATTT TTCAACGCGC AACCTGGCGC GTAA
 
Protein sequence
MSFDVVALTR KLIDVESITG NEAPVGELLV RELSALGYQV SRMPVEEERF NVWATSPGHQ 
RPKVVFSTHM DVVPPWIPSS EDEKNIYGRG ACDAKGIIAA QIDAAEKLRT KGIHAGLLFV
VGEERDSTGA YVANSHAPGS KFLINGEPTD NRIGVASKGA LRVNVIAEGK MAHSAYPELG
ESAIEKLLNA LERLRKMPLP ENPEVGPCTV NIGVIEGGRA PNVIPDQASA QLLFRLVGPS
EQLRKDIETA IAPDAHCEYA LEIPFVKLRT VPDIPTMTAK FTTDIPRLSN WGEPVLLGPG
SIHVAHTPRE FLSKQELFEA VELYVKVAEF FNAQPGA