Gene Acid345_1462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1462 
Symbol 
ID4069612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1766363 
End bp1767442 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content58% 
IMG OID637983471 
Productrod shape-determining protein MreC 
Protein accessionYP_590538 
Protein GI94968490 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1792] Cell shape-determining protein 
TIGRFAM ID[TIGR00219] rod shape-determining protein MreC 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.461363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGCT TCCTGAGTCG CCACAAGAAC CTGATCGCCC TTGCGATTGT GCTGGTGGGG 
CAGATCATGG GATTGGCGAT GCAGGTGAAG ATCAATTCAC AGGGCGGCAT GAGCCTGATG
CGCCTCTGGT CCATCTCTGC AATTACTCCG ATTGAAAAAT TACTCGTTCA TGGTCAGCGC
GGTGTCTTCA ACGGTTGGTC GAACTACTTC TGGCTGCGCG GAGTGCGCAA AGACAACGAA
CAACTGCGCG ATCAGATCGA GAAGATGCGT CTCGACCAGG TACGACTTCA GCAGGACGCA
CTGCAAGCCC GCCGACTGCA AGCGCTCTTC GACTTCAAAG AGCAGTTCAT CTCGGAAACG
ATGCCGGCGC AGGTAATAGG GTCGAGCGGC ACCGACCAAT CACGAATTCT CTACATTGAT
AAGGGGTCAA ACGACGGACT GAGAGCCGAC TTGGCAGTGA TATCGCCGGA CGGCATCGTC
GGCAAAGTGC TGCGGGTTTT GCCGACGACT TCGCAGATCC TGATGATCAA CGATCAGCTC
AGCGGCGTTG GTGCGACGCT GGAGAAAACG CGCCTGCAAG GCATTCTCGC AGGTCAGCCC
AACGGCACGC TAATCATCAA GTACATCATG AAGGACGAGA AGGTGGAAGC CGGGGAGAGC
GTCGTAACCA GTGGCGGCGA CCGCGTCTTC CCCAAGGGCC TGCCGATCGG CAAAGTTGTG
GAGTCGAGCG GTGGTAAGGA CATGTTCCTC AACATTCGCG TGGAGCCCGC CGCTAACCTC
AGCAAGATTG AAGAGGTCCT GGTGGTGACG AAGGTTGTCG AGCGTCAGCG CACCGAAGAC
GAAACCAACG GACCGGTGAA GGCCGCCGAC ATTCTCGCCC AGAGATTGCC AACGGTTCCG
AAGCAGGCGC AGAAGCTCGA TCCGAATGGA AAGCCAATCC CCGACGATCC CAACGCGGTG
AAAGTTGCGA GCCCTGTAAC GCCGAATGGC GATGCGCCAA AGACGATGCC GGCGAACCCG
GCAGCTCCGA AAGCATCTCA ACCGAAGCCC AAGACCTCCG GCGAGGCTCC GCAACAGTGA
 
Protein sequence
MESFLSRHKN LIALAIVLVG QIMGLAMQVK INSQGGMSLM RLWSISAITP IEKLLVHGQR 
GVFNGWSNYF WLRGVRKDNE QLRDQIEKMR LDQVRLQQDA LQARRLQALF DFKEQFISET
MPAQVIGSSG TDQSRILYID KGSNDGLRAD LAVISPDGIV GKVLRVLPTT SQILMINDQL
SGVGATLEKT RLQGILAGQP NGTLIIKYIM KDEKVEAGES VVTSGGDRVF PKGLPIGKVV
ESSGGKDMFL NIRVEPAANL SKIEEVLVVT KVVERQRTED ETNGPVKAAD ILAQRLPTVP
KQAQKLDPNG KPIPDDPNAV KVASPVTPNG DAPKTMPANP AAPKASQPKP KTSGEAPQQ