Gene Acid345_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1849 
Symbol 
ID4069191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2227865 
End bp2229244 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content58% 
IMG OID637983858 
Productpeptidase M20 
Protein accessionYP_590924 
Protein GI94968876 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAG CAGCGGTTGG GTACGCACGC GAAAATCAGT CGCGGTTCCT GGAAGAGCTC 
AAAGCACTAC TCCGCATTCC TAGCGTCAGC ACGGCAGAAG AGCACAAAGA TGACGTTCGC
AAGGCCGCCA ATTTCGTGGC CGAAGAACTG AAGCGTATTG GCTTCGAGAA CGTGCAGGTA
ATCGAGACCA AGGGGCATCC GCTGGTGTAC GGCGACTGGC TCCATGCCGA GGGCAAGCCG
ACGGCGCTGT GCTATGCGCA CTACGACGTG CAGCCGGCCG AGCCGCTGGA TGAGTGGCAT
ACGCCGCCGT TTGAGCCCAC CGAGCGCAAC AGCAATCTCT ACGCGCGCGG CGCCGTGGAC
GACAAAGGCC AACTATGGAT GGAAGTGAAG GCGTTCGAGT CGCTCTTCCA GACTCACGGC
GGCAAACTGC CGATCAACGC TCGCGTGCTC TTCGAAGGTG AAGAAGAAGT CGGCGGCGAA
GCGATCGAAG AATACGTCAA GACCAATCCC GACAAGTTGA AGGCCGACTT CGCGCTCATC
TGCGATACCG AGCTGTTTGC GCCGGACCTT CCAACGATTT GCGTGGGTTT GCGCGGACTG
GTTTACACCG AGATTGAGGC CCGCGGCGCG GCGACCGATC TGCACTCCGG TGTTTATGGT
GGCGCAGCGC CGAACCCGCT GTTTGCGCTC TCGACCATCA TTGCGAAGCT GAAGGACGCC
AACGGGAAGA TCCTGATTCC GGGCTTTTAT GAAGGTGTGA AGAAGCCGAC GGCTGCGGAA
CTGAAGGCGT GGCATTCGCT GCCGTTCAAT GAAGAAGAGT ATCGGCAGAA GGAAGTTGGT
TCGGATGTAC TGACAGGCGA ACCGAAGTAT CCGGTGCTGT ATCGTACGTG GGCGCGGCCG
ACGCTGGAAG TGCACGGCAT GCCGGGCGGC TTTGTTGCAC AGGGCGCGAA GACGGTGATC
CCGGCGAAGG CGTCGGCGAA AATCTCGATG CGACTGGTGC CAAACCAGGA CCCCGACGAC
ATCCTAAAGA AGTACACCGA GTACGTGACG TCGCTGACTC CGAAGGGCAT CCAGTTGAAG
TTCAAGGTGC ACAGCAAGGG CGCCGCGATC GTGGTTGGCA CCAAGAACAA GTACATCAAG
GCTGCCACGC ACGCACTGCA TGAGATCTTC CACAAAGACA CCGTATACAC GCGGTCGGGC
GGATCGATTC CAATTGTGGC GCAGTTCGCG AATGACTTGA AGATTCCCTC GGTGATGATG
GGCTTTGGGT TGCCGGATGA CAATCTGCAC GCGCCGAATG AGAAGTTCCA CATTCCAAAC
TTCCATCGCG GGATCGAATC GTTGGCCCGC TTCTTTGAGA TTTTGGGCAG CGAGAAGTAA
 
Protein sequence
MASAAVGYAR ENQSRFLEEL KALLRIPSVS TAEEHKDDVR KAANFVAEEL KRIGFENVQV 
IETKGHPLVY GDWLHAEGKP TALCYAHYDV QPAEPLDEWH TPPFEPTERN SNLYARGAVD
DKGQLWMEVK AFESLFQTHG GKLPINARVL FEGEEEVGGE AIEEYVKTNP DKLKADFALI
CDTELFAPDL PTICVGLRGL VYTEIEARGA ATDLHSGVYG GAAPNPLFAL STIIAKLKDA
NGKILIPGFY EGVKKPTAAE LKAWHSLPFN EEEYRQKEVG SDVLTGEPKY PVLYRTWARP
TLEVHGMPGG FVAQGAKTVI PAKASAKISM RLVPNQDPDD ILKKYTEYVT SLTPKGIQLK
FKVHSKGAAI VVGTKNKYIK AATHALHEIF HKDTVYTRSG GSIPIVAQFA NDLKIPSVMM
GFGLPDDNLH APNEKFHIPN FHRGIESLAR FFEILGSEK