Gene Acid345_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2024 
Symbol 
ID4070354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2424376 
End bp2425956 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content59% 
IMG OID637984038 
Producthypothetical protein 
Protein accessionYP_591099 
Protein GI94969051 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCTG GGATGCCGCT TGCTGTTTTG ACGAGGCAAC ACGCCGAGGA AAGAAACCGA 
GCGCGCTGGA CGTCGGGAAT AACGCTCGTG CTCGTGCTGG CGGGGCTCCG GTTGGTGGTG
TACGCAGTCG CGGGTCCGAA CTACGGGTAT TTCCGCGATG AGCTGTATTA CCTGGCGTGC
GGGGAGCATC CGGCGTGGGG GTACGTGGAT CAGCCGCCAA TGATCGGCTG GCTGGCGTGG
CTACTCCAGC ACACGATTGG AACGTCTCTC TATGCGCTGA GGCTTTTGCC GGCGCTGGCG
CATGCGGGGA GCATCTTCTT GGCGGGAATG CTGGCACGGG AGCTGGGTGG GCGTCGCTGG
GCGATGTTTT TGGCAGCGCT GGCCACGCTG ATGGCGCCGA TTGGGCTGGC GTTCGGACAT
CTGTTCACGA TGAATGCGTT TGATCCGCTG CTATGGGTCG CGATTGCCTA CTGCGTCGTG
CGGATTGTGA ATACCGGAAA CCAGAGGCTC TGGCTGGCGG TGGGCGGGCT GACGGGAATC
ACGCTGCTCA ACAAGTATGG AATTGCGTTC TGGATTGTGG GACTGATCGT GGGCGTAGTG
TTAACGCCGT TGCGGAGCAG CCTAAAGCAG AAGTGGTTTT GGCTGGGATG CCTGCTCGGG
GCGGCAATCT GCCTGCCGAA TTTCCTATGG CAGTGGAAGC ACCACTTCCC TTTCCTTGAA
CTGATGCGTA ACGTACGCGA GAGTGGTCGC GACGTGGTGC TTGGTCCGTT GGGATTCCTG
AAGGCGCAGT TGGAGATGAT CGGCTTTGCC GCCGGCATTC TTGTGATTGC TTCCGTGATC
TACGGATTTA CCAAAGCAGG CCGCAGCTAT CGGACGCTGA ACTGTGCGTT TTTGGTTTTT
CTGCTCGCGA TCATGGGGCT GCACGGAAAG ACGTACTACG TGGCGGCGGT GTACCCGATC
GTGTTCGCGG GGGGCGCTGT GGGACTGGGT GCCGCGACGC AGAAGCGCGG TTGGGTATGG
GTAAAGCCCT TGACGGCGGT ATTGATCGCG GCGATCAGCC TGATTTACGC GCCGATGATC
GTGCCCATCC TGCCGGTGGA TAAGTTCATC GCCTACGAGG AGAAGATGGG GATCCACCAG
CAGAAGTTCG AGCACCAGCG AGAAGGCAAG CTGCCGCAGC TTTATGCCGA CATGTTCGGA
TGGGAGCAGA TGGTGCAGAA GGTGGCCGCG TACTACAACA CGCTTTCGCC AGAGGAAAAG
GCCAAGACTG CGATCTTCGC AAACAACTAT GGTGACGCGG GCGCGATTGA TTTCTTCGGG
CCGAAGTACG GGCTGCCAAA GTCTATTGGC AACCACCAGA GCTATTGGAT CTGGGGGCCG
CGGCAGTATA CAGGGGAGAG CCTGATCGTG CTGGGCGATG ATGACGAGCG CAACATGCAG
ACGAAGTGCG CTTCGTACTC GATCATTGGG ACGGCCGAGT ACCCGCTGTC CAGGCCCGAC
GAGTGGCTGA ATATCTATCA CTGCCGCGGG TTCAAATGGA ACCTGCAAGA GATTTGGCCT
AAGACGAAGC ACTTCAATTA G
 
Protein sequence
MSAGMPLAVL TRQHAEERNR ARWTSGITLV LVLAGLRLVV YAVAGPNYGY FRDELYYLAC 
GEHPAWGYVD QPPMIGWLAW LLQHTIGTSL YALRLLPALA HAGSIFLAGM LARELGGRRW
AMFLAALATL MAPIGLAFGH LFTMNAFDPL LWVAIAYCVV RIVNTGNQRL WLAVGGLTGI
TLLNKYGIAF WIVGLIVGVV LTPLRSSLKQ KWFWLGCLLG AAICLPNFLW QWKHHFPFLE
LMRNVRESGR DVVLGPLGFL KAQLEMIGFA AGILVIASVI YGFTKAGRSY RTLNCAFLVF
LLAIMGLHGK TYYVAAVYPI VFAGGAVGLG AATQKRGWVW VKPLTAVLIA AISLIYAPMI
VPILPVDKFI AYEEKMGIHQ QKFEHQREGK LPQLYADMFG WEQMVQKVAA YYNTLSPEEK
AKTAIFANNY GDAGAIDFFG PKYGLPKSIG NHQSYWIWGP RQYTGESLIV LGDDDERNMQ
TKCASYSIIG TAEYPLSRPD EWLNIYHCRG FKWNLQEIWP KTKHFN