Gene Acid345_3975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3975 
Symbol 
ID4072448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4700074 
End bp4701525 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content62% 
IMG OID637986002 
Productglycosyl transferase family protein 
Protein accessionYP_593049 
Protein GI94971001 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCG CCCCACAGCC CAGCAGCGCT GCGACCGGAC GACGCTTCAG TCCATTTGCA 
TGGGCGATCT TCGCGCTGGG GCTGGTGTTG CGCCTGCTCT ACCTCGGCCA CAAGAGCCTG
TGGCTCGACG AAGCCGCCAC CTACACCCTC TGCCGGCTTC CCTTCCGCGA ATTCGCGCAC
GCTTGGTGGA CACACGAAGC CAACATGACC GTGTACTACG GTTTCATGCG GCTGTGGGTG
CACCTCGGCA GCAGCGAGTT CATGTTGCGA CTGCCTTCGG CGCTGTGTGG CGCAGCAGCC
GTCCCGGTGC TCTACGAGAT CGCGCGGCGA CTGTTCGATC GCAACGTCGC GTGGATGGCG
GCGCTTCTGC TCGCGGTGAA TCCGTCATTT ATCGAACTGT CGCAGGACGC GCGCAGCTAT
CCCATGACTC TGTTGCTGGT ACTCGCGAGT GCTTGGTTCT ATATCGACGC ACTCGAGACC
GATAGCATCC GCTCGTGGAC AGGGTACGTT TCGTTCGCCA GCATTGCGGT CTACGCGCAC
TTTTTCGCGG CGCTCATCAT CGCCGCCTTC TTCTTTGCCT TTCTGGCGAA GGCGCGCGCC
GAGCGTCGCT GGCTTCGCGG CATGCTGGCA TATGTGGCGC TCGGACTGCT CTGCTTGCCG
GCGGCGGCGT TCGTGGTCTT CCGCAGCCAG ACGCTGGATC TTTACTGGCT CGACACGCCC
AGCGCGAAGA TGCTGTGGAA CTTTGCGAAG TTCCTGTTCG GCAGCGGCGC GAAGGTCGTG
ATTGCGCTTC TGCTCTTGGT TCTAGGTGCG TGGTTCGTCT GGAGCCACCG CGAATGGCGC
TGGCGCGCGT TCTTCCTCTC GCTCTGGTTG CTGCTCCCTG TCGCGATTAC CGCACTCGGA
TCCATGCATC ACAGCATCTT CGCGTTCAAA TACCTGCTGA TCTGCCTGCC CGCTGGATTG
CTTCTCTGTA CAGTCGGCGC GGCGCGACTG CCGTGGCGCT GGGGATGCAT CGTGGTCGCG
GTGCTGACCG CCGCATCGAT CTTTACCGAC GTGCAGTTCT ATCGCAAGCC GCGCGAGGAC
TGGCGCGCAC TCAACCAATT TGTCGTCACC CGATATCAGC CAGGCGATGC CGTCACTTTC
TATCCGTGGT ACACGCGCAC AGCCTTCGAC TATTACCACG AGCGCACCGG CCCCGCTGGT
CTCGCTTTCG GCGTGCCGAT GGTTCCCCCA GATGATCTCG CGGGAACCGC GGAAGATCGC
GCACATCCCG AGCGCGTAGT GGAAGGGCTC ACGAATCCGC GGATATGGGT CGTGATCTAT
CACCCCGACC GTCCGGTGCC GCACGAAAAC GAGCGCGTCC AAACGCTGGT CTCGGCGGTT
CCGGCCGGAT ACGCGCAGGT GGAGAAGAAA GACTTCCCGA ACCTTGAATT GCGGCTTTTC
GAAAAGCGCT AG
 
Protein sequence
MASAPQPSSA ATGRRFSPFA WAIFALGLVL RLLYLGHKSL WLDEAATYTL CRLPFREFAH 
AWWTHEANMT VYYGFMRLWV HLGSSEFMLR LPSALCGAAA VPVLYEIARR LFDRNVAWMA
ALLLAVNPSF IELSQDARSY PMTLLLVLAS AWFYIDALET DSIRSWTGYV SFASIAVYAH
FFAALIIAAF FFAFLAKARA ERRWLRGMLA YVALGLLCLP AAAFVVFRSQ TLDLYWLDTP
SAKMLWNFAK FLFGSGAKVV IALLLLVLGA WFVWSHREWR WRAFFLSLWL LLPVAITALG
SMHHSIFAFK YLLICLPAGL LLCTVGAARL PWRWGCIVVA VLTAASIFTD VQFYRKPRED
WRALNQFVVT RYQPGDAVTF YPWYTRTAFD YYHERTGPAG LAFGVPMVPP DDLAGTAEDR
AHPERVVEGL TNPRIWVVIY HPDRPVPHEN ERVQTLVSAV PAGYAQVEKK DFPNLELRLF
EKR