Gene Acid345_3656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3656 
Symbol 
ID4072259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4326083 
End bp4327198 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content59% 
IMG OID637985679 
Productaminopeptidase DmpA 
Protein accessionYP_592731 
Protein GI94970683 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.950122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCC ACATCTTCTC TGCTCTCCTG CTAGCGTCAT CGCTGGTCTG CGCCCAGAAA 
CCTCGCGCCC GCGATCTTGG CGTTCCTTTC GACGGAACTC CTGGCAAGTT CAATGCGATC
ACCGATGTCG CCGGCGTAAC AGTAGGCCAC AAGACGCTGA TCGAAGGCAC CGACATTCGC
ACTGGCGTGA CCGCGGTGAT TCCGCGCGCG AACGATTTGT TTGATCCGGT GTATGCGGGA
TGGTTTTCGC AGAACGGTAA TGGCGAGATG ACGGGAACGA CCTGGGTGGA AGAGTCGGGA
TTCCTTGATG GCCCGGTAGT GATCACCAAC ACGCACAGTG TTGGTGTGGT GCGCGATGCG
GTGATCGCAT GGCGCCTGAA CCACCAGCCG CCGAAAACGG TGGAGGATGC GTGGTCGCTG
CCGGTTGTCG CGGAAACGTG GGACGGCTGG CTGAACGATA TCAACGGATT CCACGTGAAG
CCGCAAGACG CGTTCGATGC GCTCGACAGT GCGAAGTCAG GCCCGGTGCT GGAAGGCGCT
GTCGGCGGCG GAACGGGGAT GATTTGTAAC GAGTTCAAGG GCGGGATTGG GACGTCTTCG
CGCATGCTCG ACGCCAAGGC CGGAGGCTAT ACGGTTGGGG TGCTGGTGCA GTGTAATTAC
GGGCTGCGTC CGCAGCTGCG CATTGCTGGT GTTCCGGTGG GTAAGGAGAT TCCACAGCAC
GCTGCCTATG AAGAAGAGAA AGGCTCGATC ATCGTCGTGG TTGCGACCGA TGCTCCGCTG
ATTCCGACGC AACTCAAACG ACTTGCGCGC CGCGTAACCG ATGGGCTGGG GCGGCTGGGA
AGCATCTCGG GCAATGGGTC GGGAGATATC TTCATTGCCT TCTCCACGGC GAATCCGCAC
ACCTACTACA GCGAGAAGGC AGCGACGATA CAGACGCTGC CGCTCGACAA CATGGATCCG
TTATTTGCAG CGACCGTGCA GGCGACGGAA GAAGCAGTGG TGAATGCGCT CGTCGCCGCA
GAAGACATGA CGGGGTACAA GGGGCGGAAG GTGATCGCGC TGCCGCACGA TCAATTGCGA
GAAGTGTTGA AGAAGTACAA CCGGCTGGAA AAGTAA
 
Protein sequence
MKAHIFSALL LASSLVCAQK PRARDLGVPF DGTPGKFNAI TDVAGVTVGH KTLIEGTDIR 
TGVTAVIPRA NDLFDPVYAG WFSQNGNGEM TGTTWVEESG FLDGPVVITN THSVGVVRDA
VIAWRLNHQP PKTVEDAWSL PVVAETWDGW LNDINGFHVK PQDAFDALDS AKSGPVLEGA
VGGGTGMICN EFKGGIGTSS RMLDAKAGGY TVGVLVQCNY GLRPQLRIAG VPVGKEIPQH
AAYEEEKGSI IVVVATDAPL IPTQLKRLAR RVTDGLGRLG SISGNGSGDI FIAFSTANPH
TYYSEKAATI QTLPLDNMDP LFAATVQATE EAVVNALVAA EDMTGYKGRK VIALPHDQLR
EVLKKYNRLE K