Gene Acid345_4609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4609 
Symbol 
ID4070766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5462041 
End bp5463291 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content59% 
IMG OID637986649 
Productglycosyl transferase, group 1 
Protein accessionYP_593683 
Protein GI94971635 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.955462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.431404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTT GCCTGGTCAC AGCCTTCCCG CCTAGCCGCC GAGGGCTGAA CGAATACGGG 
TACCACATCG CACGTGAGCT CCAGCGCGAT CCCGTGCTCA GCGTCACTGT GCTCGCCGAC
GAGTTGGAGA CGCCTGAGCC GGAGTTGGCA GAATTTGACG TCAACCGCGT TTGGCGCTTC
GACAGCCTCT CCAACCCCTC ACGCCTCGCC AAAGCCATCC GCTCCTGCAA GCCGGATGTG
GTTTGGTTCA ACCTGTTGTT CTCAACCTTC GGGAACAACC CGCTCGCGGC GTTCTCCGGT
CTCACCATCC CCGCCACTAC CCGCATGGGC GGCTGCTACA CCCACGTCAC CCTCCATCAC
TTGATGGAGA ACATTGATCT CTCGCATGCC AACGTTCGTT TCCCGCGCGC CTATCGCTTT
GCCGGAAATG TCGCCACCCG CATGCTGCTC GCCGCCAACT CGATTAGCGT CCTGTTGCCG
GCCTACCGTC GTACGCTCAT CAACAAATAC AAGGGCGAAA ACGTTCACTT CCGCGCCCAC
GGCATCATGT CGGCCCGGCC TGAACCGCCC GATTACTCGC GCCGTGGCGT CCCTGACCAT
CGCGTCCTCG CCTTCGGCAA GTGGGGCACA TACAAGCGCC TCGAGCTCTT GATGGACTCT
TTTGAGCTAG TCGTGAAGCG CCTGCCGAAT GCCAAACTTA TCGTCGCCGG CAGCGATCAC
CCTATGACCC CCGGCTACCT CGATAGCATT GCCGAAAAAT ATAAGGACGA CCCGCGCATC
GAATTTGTCG GATACGTCGC GGAAGAAGAC ATCCCCGAAC TCTTCCGTAG CTCCAGCGTT
CTAGTCATGC CTTACTCCTC CGCCACAGGT TCTTCCGGAG TCGCACATCT CGCAGCCGAG
TTTGGGCTTC CCATTATCTG CGCCGATATT CCCGACTTCC ACGAGATGGC CGATGACGAA
GGATTGGGCA TCCTCTTTTA TCAAACCGGT AGCGAAAGGA GCCTCGCCGA CCAGATCTGC
GGTCTGCTTA ATTCGCCCGA AATGATGAAA GAGATGTCGG AACAAAATTT TTCCGCCGCG
CTACGACAGA CCATGCCGCA GATCATCCGG CAATATCTGC GCTCGTTTGA CTTGCACCAG
CGCCAGCGCG CGTTGCAGCC CATCGCTCGC TTTCGCCGCA TCCCCGGTTG GGTGCCCTCG
CGTTCGGCCA TCTTTCGCGC TGCCGCGCCA AGGTGGGTGC CATGGATGTA A
 
Protein sequence
MKICLVTAFP PSRRGLNEYG YHIARELQRD PVLSVTVLAD ELETPEPELA EFDVNRVWRF 
DSLSNPSRLA KAIRSCKPDV VWFNLLFSTF GNNPLAAFSG LTIPATTRMG GCYTHVTLHH
LMENIDLSHA NVRFPRAYRF AGNVATRMLL AANSISVLLP AYRRTLINKY KGENVHFRAH
GIMSARPEPP DYSRRGVPDH RVLAFGKWGT YKRLELLMDS FELVVKRLPN AKLIVAGSDH
PMTPGYLDSI AEKYKDDPRI EFVGYVAEED IPELFRSSSV LVMPYSSATG SSGVAHLAAE
FGLPIICADI PDFHEMADDE GLGILFYQTG SERSLADQIC GLLNSPEMMK EMSEQNFSAA
LRQTMPQIIR QYLRSFDLHQ RQRALQPIAR FRRIPGWVPS RSAIFRAAAP RWVPWM