Gene Acid345_3212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3212 
Symbol 
ID4070424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3799721 
End bp3800977 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content60% 
IMG OID637985233 
Productaminopeptidase T 
Protein accessionYP_592287 
Protein GI94970239 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.713242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATT CCGCCGTTGC TGCTACCGCG CTCACCTTCG AACAGAAACT CGACCAGCTT 
GCTGAAGTCG CTATCCGCAT TGGATTGGGA CTTGCTCCCG GCCAGGAACT CCTGATGACT
GCGCCGCTCG ATGCACTGCC GCTGGCGCGG CGCATTACCG AGCAGGCGTA CAAGGCGGGT
GCATCGCTCG TCACTACGCT CTACAGCGAC GACGAAGCAG TGCTGGCGCG CTATCACCAT
GCGCCCAACG AAGCGTTCGA TAAAGCACCC AAGTGGCTCT ACGACGGCAT GGCCGCTGCG
TTCAAGAGCG GCGCTGCTCG GCTGGCGATC GCCGGCGCGA ACCCGATGCT GCTCTCAAAG
GAAGATCCCG ATAAAGTAGG GCGCTCGAAC CGCGCGGTTT CTGCAGCTTC GAAGCCCGCG
ATGGAGTTGA TCACGCGACA TGAAATCAAC TGGACGATCG TTGCGGCGGC GACTCCATCC
TGGGCCGCGA CGATGTTCCC GAACGATTCC GCCGATGTAG CCATCAACAA GCTCTGGGAC
GCGATTTTCG CGACCTCGCG CGTGGGCGGC GACGATCCCG TCAGTTTGTG GAAGAAGCAC
GACGACGGAC TCCAGAAACG CGCTGCCTAT ATGAATGAGA AGCGCTACGC GGCGCTGCAG
TATCGCGGGC CGGGGACTGA TTTCCGGCTT GGCTTGTCGG ACGGCCATCT TTGGATGGGT
GGCGGAACTA CGGCAGGGAA CGGACTGTAC TGCATTCCGA ATATCCCGAC GGAAGAGATT
TTCACCACGC CGCACAAAGA TCGCGCTGAT GGCACGGTCA CTGCGAGCAA GCCGCTCTCG
CACATGGGAA CGCTGATCGA AGACATTCAC GTTCGCTTCG AAGGCGGCCG CATTGTGGAA
GCGAGAGCCT CGCGTGGGCA AGAAGTGCTG CAGAAACTCA TTGACACAGA TGACGGCGCG
CGTCGCCTCG GAGAAGTTGC TCTGGTTCCA CACTCCTCGC CGATCGCCAG CAGCGGCATT
TTGTTTTACA ACACGCTGTT CGACGAGAAT GCTGCGTCAC ATATCGCGCT CGGCCAGGCG
TACACCTCGT GCTTGATTGA CGGCGATAAG GCATCGGCAG AAGAACTCGC ACAGCGCGGC
GCGAACTCGA GTTTGATCCA CGTGGACTGG ATGATCGGCT CGAACAAGCT CGATATCGAT
GGCATTACCG CGGACGGGAC GGCGGAGCCG GTGATGCGTC AGGGCGAGTG GGTGTAG
 
Protein sequence
MNNSAVAATA LTFEQKLDQL AEVAIRIGLG LAPGQELLMT APLDALPLAR RITEQAYKAG 
ASLVTTLYSD DEAVLARYHH APNEAFDKAP KWLYDGMAAA FKSGAARLAI AGANPMLLSK
EDPDKVGRSN RAVSAASKPA MELITRHEIN WTIVAAATPS WAATMFPNDS ADVAINKLWD
AIFATSRVGG DDPVSLWKKH DDGLQKRAAY MNEKRYAALQ YRGPGTDFRL GLSDGHLWMG
GGTTAGNGLY CIPNIPTEEI FTTPHKDRAD GTVTASKPLS HMGTLIEDIH VRFEGGRIVE
ARASRGQEVL QKLIDTDDGA RRLGEVALVP HSSPIASSGI LFYNTLFDEN AASHIALGQA
YTSCLIDGDK ASAEELAQRG ANSSLIHVDW MIGSNKLDID GITADGTAEP VMRQGEWV