Gene Cla_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCla_0231 
SymbolpepQ 
ID7410954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCampylobacter lari RM2100 
KingdomBacteria 
Replicon accessionNC_012039 
Strand
Start bp204801 
End bp205826 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content32% 
IMG OID643717366 
Productprolidase (Xaa-Pro dipeptidase) 
Protein accessionYP_002574845 
Protein GI222823272 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTCA TCTTAAAAAA CGAAAATGCA CTTTTTTATG AGTGTGGCTA TTCTTGCGAT 
AATGCTTTAT TTTTAAAACT TGAAGATGAA GCATTTTTCA TCACTGATGC AAGATATAGC
TTTGAAGCTA GTGAAATGAT AAAAAATGCT AAGGTGGTTT TAGCACAAGA TCTTTTTGCT
AGTGCTAGAG AGCTTTTAGA AAAAATGGGA ATTGATAGGG TGTGTTTTGA CCCAAAAGAC
TTTAGCTATT TTGAATTTAA AGAACTTAGT AAAAGTGCAA ATATCGTTTT TGAAGAAAGA
TTAGATTTTA GTAAAAACAA ACGCATTATA AAAAATTCTA AAGAATTACA ACTTTTGCAA
AAGGCTGTAA ATTTTGGTAA AGAATGCTTT GATGAATTTG CAAAATTTAT AAGCTGTGAA
GGTCATGGTA AAAGTGAGAA AGAATTGCAT TTTAAAGCAT GTGAAATTTT TCAAAAAAAA
GGTGCTTTGA GACTTTCTTT TTCGCCTATT GTAGCTATTA ATGAAAATGC GGCTAAGGCT
CATGCTTTGC CTAGTGAGAA AAAATTAGAA TTTGGAGATT TGTTATTGGT TGATGCGGGC
GTGGTTTATC AAAGGTATTG CTCTGATCGC ACAAGAACGG CTTGTTTTGA TGAGAGTGGC
ATAGTGTTTG ATAAAAATAA GCCAAATTTT AAAGACAAAG AAATTATACA AATTTATGAA
GTGGTTAAAC AAGCTCAGCT TCAAGCTATA GAAAAAGCAC GCGTTGGTAT GATGGCAAAT
GAGCTTGATT TTATTGCAAG AGAAGTGATT AAAAATGCAG GTTTTGAAAA AGAATTTATT
CATAGTTTAG GACATGGAGT GGGGCTTGAT ATACATGAGT TGCCAAACAT TAGTCCAAGA
AGTGATTATG AGTTAAAAGA AGGTATGGTA TTTACTATTG AACCTGGAAT TTATATCCAA
GATAAATTAG GCATTAGGAT AGAAGATATG GTCTATCTTG ATAAAGAAAA GGCGGTGGTG
TTATAA
 
Protein sequence
MNFILKNENA LFYECGYSCD NALFLKLEDE AFFITDARYS FEASEMIKNA KVVLAQDLFA 
SARELLEKMG IDRVCFDPKD FSYFEFKELS KSANIVFEER LDFSKNKRII KNSKELQLLQ
KAVNFGKECF DEFAKFISCE GHGKSEKELH FKACEIFQKK GALRLSFSPI VAINENAAKA
HALPSEKKLE FGDLLLVDAG VVYQRYCSDR TRTACFDESG IVFDKNKPNF KDKEIIQIYE
VVKQAQLQAI EKARVGMMAN ELDFIAREVI KNAGFEKEFI HSLGHGVGLD IHELPNISPR
SDYELKEGMV FTIEPGIYIQ DKLGIRIEDM VYLDKEKAVV L