Gene Acid345_3224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3224 
Symbol 
ID4072559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3817505 
End bp3818839 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content61% 
IMG OID637985245 
Productaminopeptidase P 
Protein accessionYP_592299 
Protein GI94970251 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.638198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0560599 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGTA AGCTGCTGGT CCTCGCTCTG CTCCTTGCGC CGTTCTCCGG CGCCATGGAA 
CGCCAAAATA ACGCCGACTA CCGTGCTCGG CGCCAGAAAC TTGCCGCCGA ATTGAAGGGC
GGTGTTCTGG TGCTCTTCGC ACCCACCGAG CCCTCCGCCG GAAACGCCAC CAGCGGCTTT
CGCCAGGACG ATAACTTCTA TTACCTCACC GGCTGGTCGG AGCCCGGCGC CGCGATCATG
ATCGCCGCCG AAGTCGTGGC GAAAGATGAG CATCCCGCGC GTGCCTATAC GGAAGTGCTC
TACCTTCCGG CGCACAACAC CGTGCAGGAA AAGTGGACTG GCCCGAAACT CGGCCCTGAG
AACCCGCAAG CCCGCGACCT CACCGGATTC GACCGCGTCG AACTGCTCGA CAAAATGCGC
GACGACATCG CCGACCTTCT CCAAAAGGAT CCTCGTGCGC CGATCTATTC CGACATCTCG
ACTGGCGACG AAGTCTCGCC TTCCGCCGAC GGACTAGCCT GGCTGAAACG CGCCAACGCG
TTTCCCGTCG TCCGCTTCGC CGACTTCAAG CCGATCGTCA GCGACCAGCG CCGTGTTAAG
GACGCTGGCG AAATCGAGTT GATCCGCAAA GGTACGAATG CCTCCATCGC TGGCCATTTG
GCCGCATTCA AAGCCATACA TCCCGGCGTA ACCGAGCGCG AAATCGCCGC GCTGCAGATG
TACGAGTTCG GCAAGCGCGG CTGCGAGCGA CCGGCCTACG CGCCCATCGT CGGCTCCGGC
TACAACGGCA CCGTGCTGCA CTACTCCGAA GATTCCGGCA CGCTGAAAGA TGGCGACCTC
GTCGTCATGG ACGTAGCTGG CGAATACAGC ATGTACGCCT CCGACATCAC CCGTACGGCT
CCGGTCAACG GCCATTTCAC GGCCCGCCAG CGCGAGATCT ATGAAATCGT CCTTGGCGCA
CAGCGCGCGG CCATCGAAGC ATTCGTCTCA GGCAAGTCTG TGCTGCTCGG CAAGACTGAC
GACTCGCTCT ACAAAGTCGC CTACGACTAC ATCAATACCC ACGGCAAAGA CCTGCACGGC
GAGCCGTTAG GCAAGTACTT CATCCACGGC CTCGGCCACT ACGTTGGGCT TGAGGTCCAC
GACCCCGGTT CCTACGCCAC GCCGTTGCAG CCAGGCATGG TCTTCACCAT CGAACCGGGT
GTTTATATCC CCGAAGAGAA GCTCGGCGTA CGCATCGAAG ATATTGTGTA CGTTGACGCC
AACGGCAAAC TCGTGGACTA CACCGCCGCG CTCCCGCACA CCGTCGAAGA AGTCGAAAAG
GCAATGAAGA AATAG
 
Protein sequence
MIRKLLVLAL LLAPFSGAME RQNNADYRAR RQKLAAELKG GVLVLFAPTE PSAGNATSGF 
RQDDNFYYLT GWSEPGAAIM IAAEVVAKDE HPARAYTEVL YLPAHNTVQE KWTGPKLGPE
NPQARDLTGF DRVELLDKMR DDIADLLQKD PRAPIYSDIS TGDEVSPSAD GLAWLKRANA
FPVVRFADFK PIVSDQRRVK DAGEIELIRK GTNASIAGHL AAFKAIHPGV TEREIAALQM
YEFGKRGCER PAYAPIVGSG YNGTVLHYSE DSGTLKDGDL VVMDVAGEYS MYASDITRTA
PVNGHFTARQ REIYEIVLGA QRAAIEAFVS GKSVLLGKTD DSLYKVAYDY INTHGKDLHG
EPLGKYFIHG LGHYVGLEVH DPGSYATPLQ PGMVFTIEPG VYIPEEKLGV RIEDIVYVDA
NGKLVDYTAA LPHTVEEVEK AMKK