Gene Acid345_3655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3655 
Symbol 
ID4072258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4324550 
End bp4326070 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content60% 
IMG OID637985678 
ProductTPR repeat-containing protein 
Protein accessionYP_592730 
Protein GI94970682 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.822576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGCG CGGCTCCGCT GCTGCTCAGC TTGCTTCTGA TGTCGGGAAC GCAGTCGTTT 
TATGCTGCCC AGGAAACGCA CGACCATCCG GTGCCGGAAG TTCTCGGGAG CGTGACTTTT
CCGATCTCGT GCACAGCCGA AGTGCAAGGC GACTTCAACC GTAGCGTCGC TCTGCTGCAC
TCGTTTGCGT ATGCCGCGGC GTTGAACGCG TTCCAGGCAG TGGCTGAGCG TGATCCGAAA
TGCGCAATGG CGTATTGGGG CGTCGCCATG TCGGGCTATC ACCAGTTGTG GGAGCCTGCG
ATTTCGGCCG ATGGGGCTGC ACGGGCGCAG CGCGAGCTTT CACTGGCGAT GAGCGCAGGC
GCTGTAACAG ATCGCGAACG CGGGTTTCTG AACGCGGCAA ATGCGATCTT CAAGGATGCC
GATACGGTTC CGATTGCGAC TCGTGCCGGA GCTTACGAGA AGGCGATGGC GGAACTTGCA
GCGCGTTATC CGGCCGACGT CGAGGTGCAA ACGTTTTATG CGCTTGCTCT CCTGGCGAAT
GCATCGCCCT CCGACAAAAC GCACGCGCGC CAGAAGCACG CGGCGGACAT CCTGGAGGCG
CTCTTCAAGA AATACCCACA GCATCCGGCG ATTCCGCATT ACCTGATCCA CGCCTACGAC
AACGCGGAAC TCGCGGAGCG AGGACTTCCC GCGGCTCGCG CCTATGCGCA GGTTGCGCCA
TCTGCGCCGC ATGCTTTGCA TATGCCGAGC CACATTTTCA CGCGACTCGG GCTGTGGGAA
GACTCGATCG CGTCGAACAC GGCGGCGCGT ACTGCGGCAC ACCGAGCGGG TGATATCGGA
GAAGAACTGC ACGCAATGGA TTACCTCGTG TATGCGCAAC TGCAGCTTGG TCGCGATGAA
GATGCCGCGC AAATCGTCGG CGAGTTGAAG AAGATGGAGA GCCTGCACAC TGCCGATTTC
AAAGTCGGTT ATGCTGCGAC CGTGATGCCG ATTCGCTACG CTCTGGAGCG CGGAAAATGG
GCAGAGGCGG TTCAACTGCC GGTTCCCGAG TCGGCTCCGC CGCATGTGCG TGCAATTGCG
ATCTGGGCGC AATCGATTGG GAACGCGCAC ATGGAGAAGG CGAAGGAAGC ATCGGGTGCA
GTCGCACAGC TTCAGCAGAT CGAGGACGAC TTGCAAGGGA AGGGCAACGG GTACTGGGCA
ACGCAGGTGC GCGTCCTCAA GCGCGAGGCG ATGGCGTGGG TAGCGTTCGC CAACCATGAC
TTGGACAAAG CTACTTCAAC GATGCGTCAG GCTGCGGATG AGGAAGATGC GGTGGAGAAG
TTGCCGGTGA CTCCGGGGCC CGTGATTCCT GCACGCGAAC AACTCGGGGA ACTTCTGCTC
GAGCAAGGCA AACCGGCGTT GGCGGTAGAA GAATTCAACA TCGATCTGCG CAATTCGCCG
AATCGGCGAC GCGGAAGGTT TGGTTTGAAT GAAGCGACGA AGAAGGTAGA GTCGAACCAT
CGTGATGATC GCGCGCTATA A
 
Protein sequence
MSRAAPLLLS LLLMSGTQSF YAAQETHDHP VPEVLGSVTF PISCTAEVQG DFNRSVALLH 
SFAYAAALNA FQAVAERDPK CAMAYWGVAM SGYHQLWEPA ISADGAARAQ RELSLAMSAG
AVTDRERGFL NAANAIFKDA DTVPIATRAG AYEKAMAELA ARYPADVEVQ TFYALALLAN
ASPSDKTHAR QKHAADILEA LFKKYPQHPA IPHYLIHAYD NAELAERGLP AARAYAQVAP
SAPHALHMPS HIFTRLGLWE DSIASNTAAR TAAHRAGDIG EELHAMDYLV YAQLQLGRDE
DAAQIVGELK KMESLHTADF KVGYAATVMP IRYALERGKW AEAVQLPVPE SAPPHVRAIA
IWAQSIGNAH MEKAKEASGA VAQLQQIEDD LQGKGNGYWA TQVRVLKREA MAWVAFANHD
LDKATSTMRQ AADEEDAVEK LPVTPGPVIP AREQLGELLL EQGKPALAVE EFNIDLRNSP
NRRRGRFGLN EATKKVESNH RDDRAL