Gene Acid345_0768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0768 
Symbol 
ID4069513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp947425 
End bp949083 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content62% 
IMG OID637982774 
ProductTPR repeat-containing protein 
Protein accessionYP_589847 
Protein GI94967799 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.400083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAGTCC CGAGATTCCT GCGAATTTGT TGCGCGGCCC TGCTGCTAGT TGTGTGCGCA 
AGTGTTGCGA GCGCGCAGCA CGACGAGCAT GCCGCGAGCG CCAAGCCAAC CACCCTGATG
GAAGGCTTCG GCAACATCCA TCACAAAATT GCGACCGGCA ATCCTGAAGC GCAGAAGTAT
TTCGACCAGG GACTGGCGCT TGCGTTCGGC TTCAACCATG AAGAGGCAGA GCGCGCATTC
CGCAGAGCGG CTGAGTTAGA TCCGAAGGCC GCGATGCCCT GGTGGGGCGT GGCCTACGTC
GTCGGTCCGA ACTACAACAT GGACGTGGAT CCCGAGCACG AGCGCGTGGC GTTCGAGGCG
ATCCAGAAGG CCGTAGCGCT TTCCGCGAAT TCGCCGCAGG TCGAGAAGGA CTACGTCAAT
GCGTTGGCGA AGCGCTATTC CGACGCGAAG AATCCCGACT ATCAGAAACT CGCGAGCGAC
TATCACGACG CCATGGGAGC GCTCGCCAAG AAGTATCCCG ACGACCTCGA CGCGGCGACG
ATCTACGCCG AGAGCGGGAT GAACCTGCAC CCCTGGAAGC TCTGGAAGAA GGATGGCACG
CCGCAGCCGG GGACGGAGGA GATCGTCGCG ACGCTGGAAT CGGTGATCGC GCGCGATCCG
AATAACATTG GGGCCTGCCA TTTCTATATT CATGCGGTGG AGGCATCGGC GCATCCCGAG
CGCGGCGAGA CCTGCGCGAA GAAGCTGGCG GGGTTGGCGC CGGCGTCGGG TCACTTGGTG
CACATGCCCG CGCACATCTA CATTCGCGTC GGCGAGCATG AGGCCTCCGA AGAAACCAAC
GTAGCCGCAG CGAGGGCCGA CGAGGCCTAC ATCCAGCGAA CCGGCGCGCA GGGCGTCTAT
CCGGCGATGT ACTACACCCA CAACCTGCAC TTCATCGCGA TTGAGAACGC GACCCTGGGC
CGCTACTCTG CGGCGATGGA GGCGGCGCGC AAAGTCAGCG CGAACGTCGA GCCGATAGTG
AAAGACATGC CGATGGCGGA CTTCTTTGCG CCCTTGCCGA CGATGGTGAT GGTGCGCTTC
CGCCGCTGGG ACGACGTCCT ACAGGTGAAA CAGCCAAATT CGTATCAGCC CGACAGCACG
GGCGATTATC ACTTCGCGCG CGGTCTGGCG CTGGCGCACA AGGGCAAGAT CGCCGAATCG
AAAGTTGAGT TAGCGGCCCT GAACAAAGTC GCTGCAGAGA TGGCGAAAAT CCCGACAACT
CCTGCCGGCC CCGAAAACGC CGCCAAGATC CCGCAGATCA TGGCGCACGT GGTGGAGGCG
GAGATTGCGC TCGCACAACA AAAGAGTGAT GCGGCGATCG AGCATCTGAA GGCCGCCGTA
CAGCTCGAGG ATTCGATGGA TTACAACGAG CCGCCGGACT GGTTTCTGCC CGTCCGCGAA
ACGCTCGGCG GGACGCTCTT ACGCAGCGGC CAGCCGGTGG CCGCTGAAAT GGTGTTTCGG
AAGGACCTGG AAATAAACCG GCGTAATCCG CGCTCCATGT ACGGCCTGAC CGAGGCACTG
AAGGCGCAGA ACCGGATGCA GGACGCGCTG GCGCTGCAGG CGCAATTTGA CGAGGCCTGG
AAGGGCGCCG ATACCAAGTT GACGATTGAG GAGCTATGA
 
Protein sequence
MIVPRFLRIC CAALLLVVCA SVASAQHDEH AASAKPTTLM EGFGNIHHKI ATGNPEAQKY 
FDQGLALAFG FNHEEAERAF RRAAELDPKA AMPWWGVAYV VGPNYNMDVD PEHERVAFEA
IQKAVALSAN SPQVEKDYVN ALAKRYSDAK NPDYQKLASD YHDAMGALAK KYPDDLDAAT
IYAESGMNLH PWKLWKKDGT PQPGTEEIVA TLESVIARDP NNIGACHFYI HAVEASAHPE
RGETCAKKLA GLAPASGHLV HMPAHIYIRV GEHEASEETN VAAARADEAY IQRTGAQGVY
PAMYYTHNLH FIAIENATLG RYSAAMEAAR KVSANVEPIV KDMPMADFFA PLPTMVMVRF
RRWDDVLQVK QPNSYQPDST GDYHFARGLA LAHKGKIAES KVELAALNKV AAEMAKIPTT
PAGPENAAKI PQIMAHVVEA EIALAQQKSD AAIEHLKAAV QLEDSMDYNE PPDWFLPVRE
TLGGTLLRSG QPVAAEMVFR KDLEINRRNP RSMYGLTEAL KAQNRMQDAL ALQAQFDEAW
KGADTKLTIE EL