Gene Acid345_1583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1583 
Symbol 
ID4069021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1931415 
End bp1932992 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content60% 
IMG OID637983592 
Producthypothetical protein 
Protein accessionYP_590659 
Protein GI94968611 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.067578 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGCC CTCCCTCCAA TCTCGACCAG CCATTGCAGT CTCCGCCTTC AGTCCGGGAA 
CACCTGTCGG CGTTCTCGCT GATCTTCGTC GCGCTCTTCC TCAGCCACGG CGCACTCTTA
CGGCTTCCCT ATTTCTGGGA TGAGGCTGGG TATTACATTC CTGCAGCACG CGACCTTCTA
ATTACCGGCG ATTTGATTCC GCACTCGACG CTTTCGAACG CGCATCCGCC ACTGCTGGCG
ATTTACCTCG CAACCTGGTG GAAGCTGAGC GGCTTCACGC CCGCAGTGAC GCGAATCGCC
TTACTCATCG TCACCTCGTT CTCGCTCCTT GGGCTCTGGC GACTCGCGTC TGTAGTGGCG
AACCGTACGG TTGCGGTGAT CGCCGTCCTC TGCACTGCCG CGTTTCCGGT TTTCTTCGCG
CAGAGTTCTC TCGCCCACCT CGACATGCTG GTGACCGCAT TCACCATCTG GGGAGTGGCC
TTCTACTTGG AAGGCCGCAA TGTTCGTGCG GTAATCATGT TCGCCTTCGC GGGACTTTCA
AAAGAAACGG CGATTCTTAC ACCCGGCGTT CTCATCGCAT GGGAGGTATT CGCTCCGGCA
AGATGGCGCA CCGGTCCGCG AAACTGGAAG CGCGCGGTGC GGCTGCTGCT GAGCTTCGTT
CCGCTGGCGA TTTGGTTTCT GTATCTGCAC CATCGCACCG GCTTTTACAC TGGCGATCCG
TACTACTACA GCTATAACGT CGGCGCGACG CTCACTCCAC TCCGCATCGT CCTCGCCATC
GTGCTCCGTT TGTGGCACAC GCTCGGCTAT ATGAACCTGT TCGTGCTCAC GCTGCTCACG
TTGGCTGCAA TGCTCGAACC CGCCGTGGTG GATGGCACGG CCGCACGACG CCGTATCGCA
CTCCCCGTGC AGATGGTTCT GTACGCGGTA ATCGCCGCAC ACGTGGTGTC GCTTGCGATT
GCAGGCGGCG CGGTGCTCGC ACGCTACATG TTGCCGGTCT ATCCGCTGAT TGTGCTGGTC
TGCGTGAGCA CGCTCTATCG ACGCTTGCGC TGGTGGCCGG TCGCAACCGC GGTCGTCGTT
GCGTCGTTTC TGGCAGGCCT GGTGTTCAAT CCGCCGTACC GCTTCGCGCC GGAAGACAAT
CTCACCTATC GCGACTACAT CCTCCTGCAT CGTGGCGCTG CAACGTATCT CTCGCAGCAC
GCACACGGCG CACATGTCCT GACGGCATGG CCTGCCTCTG ACGAAATCTC ACGGCCGTTT
CTTGGATACC TGAAAGAGCC CATCCCTGTC GTCCGCATCG AGAACTTCAC CGCGGCGCAG
ATGACGCTCG CCGCCGCTGC GCAAGGCCAG TACGACTGGG TCTATCTCTT CTCAACGAAA
TATGAGCCTC CACACCTGCT TATTCATTCA GCCTACTGGG AGGGCATGCA GAAGCGGTTC
TTCGATTACC ACATCGACAT CTCGCCCGAA GTGGCAGCGC GCATGGTCGG CGGACGCATT
GTGTACCAGT CCCATCGCCG CGGGGAATGG GTGGCTCTGG TGCAGATCGA GCACGCGGAG
AACGCGCGGC TTCGCTAG
 
Protein sequence
MSSPPSNLDQ PLQSPPSVRE HLSAFSLIFV ALFLSHGALL RLPYFWDEAG YYIPAARDLL 
ITGDLIPHST LSNAHPPLLA IYLATWWKLS GFTPAVTRIA LLIVTSFSLL GLWRLASVVA
NRTVAVIAVL CTAAFPVFFA QSSLAHLDML VTAFTIWGVA FYLEGRNVRA VIMFAFAGLS
KETAILTPGV LIAWEVFAPA RWRTGPRNWK RAVRLLLSFV PLAIWFLYLH HRTGFYTGDP
YYYSYNVGAT LTPLRIVLAI VLRLWHTLGY MNLFVLTLLT LAAMLEPAVV DGTAARRRIA
LPVQMVLYAV IAAHVVSLAI AGGAVLARYM LPVYPLIVLV CVSTLYRRLR WWPVATAVVV
ASFLAGLVFN PPYRFAPEDN LTYRDYILLH RGAATYLSQH AHGAHVLTAW PASDEISRPF
LGYLKEPIPV VRIENFTAAQ MTLAAAAQGQ YDWVYLFSTK YEPPHLLIHS AYWEGMQKRF
FDYHIDISPE VAARMVGGRI VYQSHRRGEW VALVQIEHAE NARLR