Gene Acid345_4212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4212 
Symbol 
ID4072171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4988627 
End bp4990300 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content59% 
IMG OID637986243 
Productglycosyl transferase family protein 
Protein accessionYP_593286 
Protein GI94971238 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.872167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0154714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAC GAGCACCTCA ACTGCGGCTG CTGCTTGAAG TCTTTCTGCT TGCCGCGCTC 
TGCTATTTCT TTTTCTTTTT TGGACTCTCT GCTTTCGGCT TAACCGGCGC CGACGAGCCT
CGCTATGCGC AGGTCGCACG CGAGATGCTT CAGCACCACG ACTGGGTCAC GCCGACTCTC
TACGGGAACG TCTGGCTGGA AAAGCCGATC CTCTACTACT GGGGCGCAAT CGTCAGCTAC
AGGATCTTCG GCGTGAGCGA CTGGGCCGCA CGGATTCCTG GCGGAGTCTT CGCGAGCGCG
ATGCTCCTAT TCCTCTACGC GTGGACGCGC CGCTTCCGCA ACGGCTCGCA ACTCGATGCG
ATTGTGATGA CCGCGTCGTC CGTGTTTGTA TTCGCTTTTG CCCGCGCTGC TTCGATTGAC
ATTCATCTGG TCGCTCCGCT GACGATCGGC ATGCTCGCGT GGTGGGCATT CTACGAAACG
GGCCATCGCG GATGGCTGGC GTTGTTTTAC GCCATGATCG CGATCGGAGT GCTCGCGAAA
GGACCAGTCT CAGCGGCGCT GGCGGCAATG GTCATTCTCG TCTTCGTGGC GATCCGCCGT
GATTGGTCGG CCATCGTCCG CACCCTCTGG ATTCCCGGCA TCCTGATATT CTTTGCAATC
GCGCTGCCCT GGTACGTCGC GGTGCAGCAT GCGAATCCGG GCTTCGTCCG CGAGTTTTTC
ATCACCCATA ACCTGAGTCG TTTCACGACC AACCGTTTTC AGCACCGCCA GCATTTCTGG
TACTACATCC CGGTCCTGAT CGGCGGCACC ATGCCGTGGA CGGTCTTCGT CATCGCTGCG
CTTGCAGGCG GAATCAAGTC GCTGCGCGAT AAGAACGAAG ATCCGTTGCT CACCTATCTC
GCGGTGTGGG TACTCATCCC GCTGATCTTC TTTTCGTTTT CGCAGTCGAA GCTGCCGGGC
TACATCCTGC CTTCGATCGT TCCGTGCGGT CTGCTCGTCG CCATCTGGCT GCGCCGCGAG
AACACCAAGC CGTCGCCGGT GCTTATTTCG GTGCATGCCT TGCTGTCCGG AGCTGTGCTC
GCAGTTGCTC TGCTCGCGCC CTACAAGCTC TACAAAATGC CGCTGCCCGG CCAGGTGCTG
CGCATCGCAA TTCCAGTAGG GTTGATCGTC GCGCTTGTGG TTGCGATTGT TGTTTTCCTG
CGCGGATACG CGGCACTCCG CTTCGCTACG ATATTTCCCG TCGCGCTCGC ACTGGCGTTT
CTGCTCAAAG CCGCCGGCCC TGCTATTGAT TCCACGCAAT CCATCCGGCC AGTTGCAGCG
CGCATCACGA ACTCCTTTGC CACAAATGAG CCGGTAATGT TCTACAACGT CCCGCGCGGC
GTTGAGTATG GACTGGCCTT TTACCTCGAT CGCCCCCTGC CCGAGCCGCC ACCGGACGAA
ATCGTCAGGT TCGGCACGGC TGCTGCCGGC AACCAGCAGA GAGAAAAGAC TTTGAAAGAT
ATCAGCAACT CCCTCCCGCC GAACCACGGG AATTATGTGC TGGTTACGCG CGCGGGAGCG
ATCAACCGCT TTGCCGATAC CGTGCCTCCG AACTACCAGA TCGAGCCGTT CTTCCGCTTC
CAACCGCAGC GCCTCGACGT GTATTTCTTG CGCGATATCG GCCCTGGACG CTGA
 
Protein sequence
MNERAPQLRL LLEVFLLAAL CYFFFFFGLS AFGLTGADEP RYAQVAREML QHHDWVTPTL 
YGNVWLEKPI LYYWGAIVSY RIFGVSDWAA RIPGGVFASA MLLFLYAWTR RFRNGSQLDA
IVMTASSVFV FAFARAASID IHLVAPLTIG MLAWWAFYET GHRGWLALFY AMIAIGVLAK
GPVSAALAAM VILVFVAIRR DWSAIVRTLW IPGILIFFAI ALPWYVAVQH ANPGFVREFF
ITHNLSRFTT NRFQHRQHFW YYIPVLIGGT MPWTVFVIAA LAGGIKSLRD KNEDPLLTYL
AVWVLIPLIF FSFSQSKLPG YILPSIVPCG LLVAIWLRRE NTKPSPVLIS VHALLSGAVL
AVALLAPYKL YKMPLPGQVL RIAIPVGLIV ALVVAIVVFL RGYAALRFAT IFPVALALAF
LLKAAGPAID STQSIRPVAA RITNSFATNE PVMFYNVPRG VEYGLAFYLD RPLPEPPPDE
IVRFGTAAAG NQQREKTLKD ISNSLPPNHG NYVLVTRAGA INRFADTVPP NYQIEPFFRF
QPQRLDVYFL RDIGPGR