Gene Acid345_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4031 
Symbol 
ID4071170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4765275 
End bp4766438 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content57% 
IMG OID637986061 
Productglycosyl transferase, group 1 
Protein accessionYP_593105 
Protein GI94971057 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.317277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACA AGAAGCCCAT GAAAATCGGC ATCACGTGTT ATCCCACCTA CGGCGGCAGC 
GGCGTGGTGG CCACTGAACT CGGCATCGAA CTCGCGCAGC GCGGGCATCA GGTGCATTTC
ATTTCCTATT CGCAGCCTAT CCGCCTGACT GAACCGCACC CCAACATCCA TTTTCACGAA
GTCGAAGTCT CGCGCTATCC ACTCTTTGAG TACCCTCCGT ACGACCTCGC CCTCGCCACG
CGCATGGCCG AGGTCGCCGA GATCTACAAC CTCGATCTGC TGCATGTTCA CTACGCCATT
CCGCACTCAG TCAGCGCACT GCTCGCCCGC GAGATGACCG CATTCGGACC CGGCCGCAAA
CGCCATCTGC CATTCGTCAC CACCCTTCAT GGCACGGACA TCACGCTCGT CGGCCTCGAT
CCTTCGTATC TGCCGATCAC GCGTTTTTCC ATCGAGAAGA GCGATGGCGT CACCTCGATC
TCGAACTACC TGCGCGAGAA GACGCTCCAG GCATTCGGCA TCAAAAACGA AATTCGCGTC
ATTCCCAACT TCGTGAACTG CGATATCTAT CATCGCGACG GCAAAACGCA ACACTATCGC
AAAGAGTGGG CCCCGAACGG CGAACGCGTG GTCGTGCACC TCTCGAACTT CCGTCCGGTA
AAGCGCGTCC CTGATGTCAT TGAGATCTTC GAGCGTATCC AACAGAGAGT TCCTGCGAAG
CTCGTCATGA TCGGCGACGG TCCAGATCGT TCGCGCGCCG AATGGATGGT CGTCGAAAAG
AAGCTGCAGG ACCGCGTTCT CTTCCTCGGC AAACAAGACG ACGTCCACGA GAAACTGCCC
GCGGCCGATC TCATGCTAAT GCCTAGCACG CTCGAGTCTT TCGGACTCGC CGCGCTCGAA
GCCATGGCTT GCGAGGTGGT TCCTGTCGCG ACGAAAGCTG GAGGCGTTCC CGAAGTCATT
GACCACGGCG TGGACGGCTA CCTCGCCGAT GTCGGCGACA TTGACACCAT GGCCATGTAC
TCCATCGACA TCCTGAGCGA CGACGAAAAA CTCCACGAGA TGGCGAAAAT GGCGCGTTTC
AAAGCACAAT CCACCTATTG CGCTTCGAAG ATTATTCCGA TGTACGAAGA TTTTTATCGT
GAGGTGCTGG AGCGTGCTTC GTAG
 
Protein sequence
MTNKKPMKIG ITCYPTYGGS GVVATELGIE LAQRGHQVHF ISYSQPIRLT EPHPNIHFHE 
VEVSRYPLFE YPPYDLALAT RMAEVAEIYN LDLLHVHYAI PHSVSALLAR EMTAFGPGRK
RHLPFVTTLH GTDITLVGLD PSYLPITRFS IEKSDGVTSI SNYLREKTLQ AFGIKNEIRV
IPNFVNCDIY HRDGKTQHYR KEWAPNGERV VVHLSNFRPV KRVPDVIEIF ERIQQRVPAK
LVMIGDGPDR SRAEWMVVEK KLQDRVLFLG KQDDVHEKLP AADLMLMPST LESFGLAALE
AMACEVVPVA TKAGGVPEVI DHGVDGYLAD VGDIDTMAMY SIDILSDDEK LHEMAKMARF
KAQSTYCASK IIPMYEDFYR EVLERAS