Gene Acid345_3222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3222 
Symbol 
ID4072557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3814250 
End bp3815884 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content58% 
IMG OID637985243 
ProductTPR repeat-containing protein 
Protein accessionYP_592297 
Protein GI94970249 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.123328 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGTCGT CGCAGGCGAT CTTCCGCGCT GGATTGGTGC TGCTTCTCGC GCTCTACATC 
CGCACGCTGA GCTTTGATTT CGTATTTGAC GATCTGCTCT TCCAGCGCAT TCCTTGGATT
CATAGTTGGC ACGCGCTGAT TCATGCGTTC CGCGTGGATG CGTTCGGCGG CACGATTGAA
GGCGGATCTT CGTACTATCG GCCGTTCGTG TCGGTCTGGT GGGCTCTGGT CGAGCGACTC
ACGCCCGGGA CGGCAGCGTG GTATCACCTG GCGACGTTGC TCACGCAAGT GTTGGTGTAT
TTCACTGCAT TTCGCTTTGG GTGTGAGTTC TTCGAGGATG AACAACTTGC TGCGCTCACG
GCGATGTTGT TCGTGCTCCA TCCGTGTGAA GTGGAATCGA CGGCATGGAA CGTAAGTGGA
GCGAACAACG GGCAAGCCGC CATCTATTTC TTCGTAACGC TGATCTTTTA TTTTCGCTGG
TGGAAGACGA AGCGCTGGGG TTGGCTTGCG GGTTCGGGCG CGTTTCACTT GCTGGCGCTG
CTGACGAAGG AGTCGCTCGT AATCACGCCA GTGCTCGTGC TGTTGCATTG CGCGATGCAG
AGCGAGCGCG CGGCACGATG GCGGAGCACG CTTGTTGTTC TTGTGCCGTA TGGGCTCGCG
ACGGGGGTGT ACCTCGCGCT ACGACAAGCC GCGATCAAGC CGCTGGCTGG TCGGTCGAAT
GCGATCAGGA CCGGGGTGAA CCTCGGTGAT CTGTGGTCGG GGCCTGCGGC ATTCTGGTGG
TACCTGAGAA AACTGATCGT GCCAACGCGC ATGGCGATCC TGCACGACTG GACGCCGGTT
ACGGGGCCGT CAACGGCTAG ATTCGTTCTG CCGCTAATGA TCTTCGTTGC GTTTTGTGTG
TTGGTTGTAT GGGCGTGGAA GAGGACGGGA TCGTGGCGGG TACTGTTTCT GGCTGCGTCG
TTCCTGCTGA ATCTGGTGCC GGTGATCGTG TACGCGAATC GCGTGACGAT GCACGAACGC
TATCTGCAAT TGCCTTCGTA TTCGTTTTGC GCGCTGCTGG CGTACGCGGC GCTTTGGGCA
ATGCGTGACG GCGGCACCAA ACGAATTTTC GCGATCGTGT TTTCGGTTTC GCTGATCGCA
GCGTGGTCGG CGGTTACGTG GTACGAGACA GGCTTTTGGG ATAACAATCT GACGCTGTGG
ACGCGGGCGG TGCAGGTTGC TCCGCATAGC GTGAATGCGC GCGTGGAACT AGCACGACTG
GTCACGGAAC AAGATCCGGG AGCGGGAATT CGCGTACTTG ATGAAGGGCT GCAGGTGTTA
CCGGAGTCGC CGGGGCTTTG GCGGAGCAAG GGGTTGATGG AATTCAATGC CGGGAAGCTG
AGCGATGCCG GGAAGTCTTT CCGCAGATCG CTCGAAGTTT CAAGCAGGTT CGCGGCGAAT
CCGGCCACCG AGCCGTCCGA TGTGAAATAC GGCCGTGCTA CGGCGTTGTT CTTCATCGCC
CAGATCGAAC AGCAGAAGGG CGATTTGGAG TCGGCGGAGC AGCATTATCG AAACGCGCTC
GATATTGATC CCGAAAACGC GGAATATCAA CGCGGGTTAG CGGGATTGCT CAGCAAACAA
GGACGAGGGC AGTAG
 
Protein sequence
MPSSQAIFRA GLVLLLALYI RTLSFDFVFD DLLFQRIPWI HSWHALIHAF RVDAFGGTIE 
GGSSYYRPFV SVWWALVERL TPGTAAWYHL ATLLTQVLVY FTAFRFGCEF FEDEQLAALT
AMLFVLHPCE VESTAWNVSG ANNGQAAIYF FVTLIFYFRW WKTKRWGWLA GSGAFHLLAL
LTKESLVITP VLVLLHCAMQ SERAARWRST LVVLVPYGLA TGVYLALRQA AIKPLAGRSN
AIRTGVNLGD LWSGPAAFWW YLRKLIVPTR MAILHDWTPV TGPSTARFVL PLMIFVAFCV
LVVWAWKRTG SWRVLFLAAS FLLNLVPVIV YANRVTMHER YLQLPSYSFC ALLAYAALWA
MRDGGTKRIF AIVFSVSLIA AWSAVTWYET GFWDNNLTLW TRAVQVAPHS VNARVELARL
VTEQDPGAGI RVLDEGLQVL PESPGLWRSK GLMEFNAGKL SDAGKSFRRS LEVSSRFAAN
PATEPSDVKY GRATALFFIA QIEQQKGDLE SAEQHYRNAL DIDPENAEYQ RGLAGLLSKQ
GRGQ