Gene Acid345_0917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0917 
Symbol 
ID4070569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1155559 
End bp1157259 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content57% 
IMG OID637982924 
ProductTPR repeat-containing protein 
Protein accessionYP_589994 
Protein GI94967946 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00563549 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGCTTGG CAACTCCCGC CGTATTCGCC CAACAGTCGG CGGCTGATTT TTATAAGCGG 
GGAGTTCAGG CCTACGGGCG GGGAGACGAC GCATCCGCGC TCTCGTCTTT TCAACAGGCA
TCGAAACTCG ATCCCAATAA TCCCGAGTAT CAGAATGCCG TAGGCCAGGC GCTGTTCAAG
CAGGGAAGAC CAGCCGAAGC AATTCCGTAT TTCCGTCATG CCCTCAAACT CCGCCCCGAT
CTCGCAGTCA TTCATGCATA CCTGGGTCAA GCTCTTCTCG CCGATCACCA GGCTGATGCC
GCCATTTCCG AATACCGTAT CGCTGTCAAA ATGGCTCCCA ACGAAGTCGA GGCCAATCGT
GGATTGGGTC GCTCGCTCAG CACCAAAGGG GACCTCGACG GCGCCATCGC CGTCTATCGT
TCCGCACTGG AGACCAATTC GCAAAGCGCG CCACTTCATG ACGATCTCGG CTCGTTGCTG
GCCCAGAAAA AAGACTTCGT TGCCGCGCAA CAGCAATTCG AACAAGCCTT AAAACTCGAC
CGCCAGTACG AGCCCGCACA TTTTCACCTT GGCGTCGCGC TACTTTCACA AGACAAAGAT
CCTGAGGCAA TGCTTTCTTT ACAGGAAGCG GTGCGTCTCG CGCCGAACGA TGTTGCCGCC
CACTTCTTTC TCGGTCGCGT TCTCGAGACA CTCGGCGACA ACGCGAATGC TCTACAGAAC
TACAAAGACG CTGCCCAACG CTCTTCCGAA TTTCCCGGCC TCCAGGAGAG ACTTGGACTC
ACAGCGCAAC GAGTAGGCGA AATGCCGACC GCGATCTCCG CTTTCCAGAA AGCCATCGCG
CAATCCCCGC AGAACCCCGA TCTTCATAAC GACCTTGGCC TGGCATTCAT GCAGGCTGGA
GATGGCGAGG GAGCTATTCG GGAATTTAAC CAGGCCCTCA ACCTGAAGCC GGAGGATGTC
GGCTATCTCG GAAATCTCGG GGCCGCCTAC CTTCAGCTTT CCGAGTTCGA CAACGCCGTT
GATAACTTCC GCAAAGCTCT CCAGATCGCG CCGGCCAACG CATCACTGCA CCATGATCTC
GCGTTGACAT TGAAGTTGAA GGACGATCTC GCCGGAGCTG CAGCGGAGCT TCGCGAGGCC
ATCCGGCTCG ATCCTAAACT CTACGACGCA CATTACACGC TGGGAGTCAC CCTTTGGCAG
CAAGGCGAGT TTCCCGCCGC CGTTGAAGAA CTCGAAGCCG CCCTCGCCCA GAAGCCCGAC
TATGCTGAGG CTTATTACAC CCTCGGCACC GTTTACAAGC AGATGAATAA ACCGCGTGAA
TCCGCCGAAG CACTTCGCTC TGCATTGAAA ATTCAGCCCG ACTTCGCCGG CGCTCACACG
ACTCTAGCCG CAGTCCTCCG TCAATTGGGC GACACCGCTG GTGCCTCCGA AGAAGCACGT
ATCGGCGCGG AACTTGCAAA GAAGAAAACC GGCATGCAGG CCGCGGTGTT CGCAACCAAC
TCTGGAATTC GTCTCCTAAA TGCAGGCGAT CTGGATGGGG CTGTTTCCCA ATTCCGACGG
GCTACCGAGT CGGCACCCGA CTACGCCATG GGGCACTTTC AACTCGCAAC TGCACTCTCC
CGCCAAGGCA AACGCGACGA GGCCGATGCC GAGTTCTCCA AGGCTGCAAC CCTTGATCCA
CACCTGAAAA CCCAGAAGTA G
 
Protein sequence
MCLATPAVFA QQSAADFYKR GVQAYGRGDD ASALSSFQQA SKLDPNNPEY QNAVGQALFK 
QGRPAEAIPY FRHALKLRPD LAVIHAYLGQ ALLADHQADA AISEYRIAVK MAPNEVEANR
GLGRSLSTKG DLDGAIAVYR SALETNSQSA PLHDDLGSLL AQKKDFVAAQ QQFEQALKLD
RQYEPAHFHL GVALLSQDKD PEAMLSLQEA VRLAPNDVAA HFFLGRVLET LGDNANALQN
YKDAAQRSSE FPGLQERLGL TAQRVGEMPT AISAFQKAIA QSPQNPDLHN DLGLAFMQAG
DGEGAIREFN QALNLKPEDV GYLGNLGAAY LQLSEFDNAV DNFRKALQIA PANASLHHDL
ALTLKLKDDL AGAAAELREA IRLDPKLYDA HYTLGVTLWQ QGEFPAAVEE LEAALAQKPD
YAEAYYTLGT VYKQMNKPRE SAEALRSALK IQPDFAGAHT TLAAVLRQLG DTAGASEEAR
IGAELAKKKT GMQAAVFATN SGIRLLNAGD LDGAVSQFRR ATESAPDYAM GHFQLATALS
RQGKRDEADA EFSKAATLDP HLKTQK