Gene Acid345_4571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4571 
Symbol 
ID4071516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5417574 
End bp5418887 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content56% 
IMG OID637986611 
ProductHipA-like 
Protein accessionYP_593645 
Protein GI94971597 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.31649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA CTAGGGTCGC CGACACTCGC CCAAGCACGT TGCTTGTGAA GCTTGGCGAC 
ACCGCGGTGG GCACGATCAC ACAGCTTGGG GGCTTCGACA GAAATCTGTT TGCATTCGAC
GCGGCCTACC TTGCCGATGC GCAGCGGCCG ACGTTAAGTC TCAGTTTTCT GGATGTGGAG
GGGCAACCGA GAATCACGGA ACAACTCACG CGAAGCAAGG TCCCACCGTT TTTTTCCAAC
CTGCTACCGG AAGGGATGCT TCGTGAGTAC CTGGTCGAAC GAACGGGAAT ACCGTCGGAA
AAAGAGTTCC TACTGCTGTG GATGGTGGGG AGGGATTTAC CGGGAAATGT GATCGTCGAG
GACATGGAGG GCCGTCCATC GCCGCCTCTT TCGGAGTATC TCGGCGGCAG ACTATCGCTC
ACTGCGAACC GTCGCGCTGC CCCTCTACCC CGCTTTTCCT TAGCGGGAGT GCAAATGAAG
TTCGGCGCGG GGAAGCACCC TGGCAATCGG CTCAGTATTC CGGCGCGGGG ACTTGGCGGA
GATTGGATCG TGAAGCTACC GTCTCCACAG TACGATTCGC TGCCTGACAA CGAGTACTCG
ATGATGATGC TTGGCAAAGA CATCGGCATC GACGTGCCTG AGTTTGGGTT AGCAACTACG
AAACGTATAG AAGGAATTCC GGAGGGATTT GCGAATCTCG ATGCGAATGC CTACTACGTA
AAGCGCTTCG ATCGGACGCC CAAGTCGCGG ATTCATATCG AGGACTTCAA TCAAATCTTC
GGCCAGTTCC CCGATCAGAA ATATGGGAAG CAAAGTTACA ACGCTATCGG AAAGAACATC
TTCAGAATTC TGGGTGAAGC GGATTATCGG GAGTTTGTGC GACGGCTGGT TTTCAGCATC
CTCGTCGGCA ACATGGATAT GCACCTGAAG AATTGGTCCG TGGTGTACAA GGATGGCCGA
ACACCAAGGC TCTCTCCGGC TTACGATCTT GTCTCGACAA TTGTGTACCC TGGGATCGAC
AAGGCGTTGC CGCTCTCCTT CGCGGGCACG AAAGATGCGC AGCAGGTGGA TGAGGATTTG
CTCGTAAGCT TCGCCGCAAA AACCGAGGCC CCGCGCAACT ACGTACTCGA AACGGCGACC
GAGACGGTAC GCAGCTTCAA AGATGCGTGG TCCGCAAAAG CAAAAGACCT GCCTTTGCGA
AAAGAGTGGA GAGAGATGAT TTCGGCTCGC CTCGCGACCC TGCCAATCGC CAACCTGGGA
TCGCAGGAAA CCACCAAGAA AAGACGACGA GGAAGGCCGC GGCGCGGCGC GTGA
 
Protein sequence
MKKTRVADTR PSTLLVKLGD TAVGTITQLG GFDRNLFAFD AAYLADAQRP TLSLSFLDVE 
GQPRITEQLT RSKVPPFFSN LLPEGMLREY LVERTGIPSE KEFLLLWMVG RDLPGNVIVE
DMEGRPSPPL SEYLGGRLSL TANRRAAPLP RFSLAGVQMK FGAGKHPGNR LSIPARGLGG
DWIVKLPSPQ YDSLPDNEYS MMMLGKDIGI DVPEFGLATT KRIEGIPEGF ANLDANAYYV
KRFDRTPKSR IHIEDFNQIF GQFPDQKYGK QSYNAIGKNI FRILGEADYR EFVRRLVFSI
LVGNMDMHLK NWSVVYKDGR TPRLSPAYDL VSTIVYPGID KALPLSFAGT KDAQQVDEDL
LVSFAAKTEA PRNYVLETAT ETVRSFKDAW SAKAKDLPLR KEWREMISAR LATLPIANLG
SQETTKKRRR GRPRRGA