Gene Acid345_4525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4525 
Symbol 
ID4070203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5368376 
End bp5369629 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content59% 
IMG OID637986564 
Productpeptidase S7 
Protein accessionYP_593599 
Protein GI94971551 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.85938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTTA CCGAGTGGTG CCAGCGACAA TTGAGCGGCG GGAAACGTGC CATCGTTTAT 
TGCGCTGTCG TTGCATTTCT CTCGACAGCT TCGAACGCTC AATCGCAGGC GGCCCCAAAG
TTTTATCAGC AAGGCGCGCA GTGGGTGGAA GAGTTCAGCG GATTTGCACC GATCTCGCAC
ATGCTGAAAG TGATGATGTC GGTTGGGGCC ATTCATATCG AAGGCGGCGG GCAAGACGAG
ATCAGCTATA CCGTGCGTAA GCGATGCATG CGCAGCACGC AGGAAGCGGC TCGCAAGATT
TTCGACCAGT TCCGGGTTTT CACGGCGAAG AAGACAGATG CCACGATCAT CCAGGGCGAC
TGGCTAGGCG GCAAAGACGT GAATGATCTG ATGGCCGACC TTTTCGTCCA AGTACCGCCG
AGCGTAAGCG CCGTTGCGGT AAATGCGAAG GAAGGGAATG TCACGGTCAG GGCGATCCGT
GCCAAGCTCG ACATTGATAC GTCGGCCGGG AACATTGATC TCGACCAGAT TGCAGGAGCG
GTAAAGGCAC ACACTTCCGG CGGCTTTATC ACCGCTGGTA CGCTGCTTGG CGATGCCCAG
CTGAAGAGTG GCGCGGGCAA TGTGCAGGCG CGGGCCATCA GTGGCAAGGC CACGTTGTGG
ACCGCAGGTG GTACGACATC GCTTGGGAGC GCGCGCTGGT GCTGGCTGGA GACACTGGCG
GGCAACATCA GCGTGGACCA TTGTGACGGC GAGACGCACG CGATCAGCGG TGGGGGTAGC
ATCACGCTGG GGAACATCAA CGGCGACGTC TTCGCGCAGA CTGGCGGAGG ACGCATCCAA
CTCGCGAGCG CCGCAGGACA TGTCACGGCG GCAACCGGTG GTGGCGCGGT GGAATTGCAT
CGAGTGGCGC GAGGTGTTCA GGTGGATTCC GGCGTGGGCG CGATTTCGGT AGAATTTTCG
GGATCGCCAA GGACATTCAG CGACTCGCTG ATCCGCACCT CGTCTGGCGA TGTGATTGTG
TACGTGTCGG ACAGCCTGCC GATGACCGTC CACGCCGCGA GTGATATGAC GCGCGGCCCG
GGAATCAGCA GCGAGTTTCC AGAGATAAAG ATTACGTCCG AAGGTGGAAA ATATGGACCG
AAGTCGATGT TCGCCGAGGG TACACTGAAT GGCGGCGGCC CGGTCCTGAA AGTTCGGACG
ACGATAGGGC AGATCGAGTT CCACCGGACA AACACGGCGG TGTCTGCAAA ATGA
 
Protein sequence
MEVTEWCQRQ LSGGKRAIVY CAVVAFLSTA SNAQSQAAPK FYQQGAQWVE EFSGFAPISH 
MLKVMMSVGA IHIEGGGQDE ISYTVRKRCM RSTQEAARKI FDQFRVFTAK KTDATIIQGD
WLGGKDVNDL MADLFVQVPP SVSAVAVNAK EGNVTVRAIR AKLDIDTSAG NIDLDQIAGA
VKAHTSGGFI TAGTLLGDAQ LKSGAGNVQA RAISGKATLW TAGGTTSLGS ARWCWLETLA
GNISVDHCDG ETHAISGGGS ITLGNINGDV FAQTGGGRIQ LASAAGHVTA ATGGGAVELH
RVARGVQVDS GVGAISVEFS GSPRTFSDSL IRTSSGDVIV YVSDSLPMTV HAASDMTRGP
GISSEFPEIK ITSEGGKYGP KSMFAEGTLN GGGPVLKVRT TIGQIEFHRT NTAVSAK