Gene Acid345_0746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0746 
Symbol 
ID4068622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp919323 
End bp920579 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content58% 
IMG OID637982752 
ProductNHL repeat-containing protein 
Protein accessionYP_589825 
Protein GI94967777 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGGCA TGAGCGACAA GTCGAAATCG AAATTAGCAA CAACCGCGAC GTGCATCCTC 
GCGCTGGTGC TCGGCTGCGC TGTGCCTGTC TTCGGTGGAA AAGATAAGAA GAAGGATGCT
GCCGCGCCTG TGCAGGAAGA GTCGGTTCTG AAGAAATTGG ATTACTCGAA GATCGTGTGG
CCGAACCCGC CTGCGATCAC GCGGCTCAAG TATGTGGATT TCTTCGCGGG CGAGAAGATC
CAGACGCAGA TCGTTCAGGA AAAGAAGAAG TCGGAGTGGA TGGCACGACT GGCGGGCGGC
GATTCCGAGG GCAACGGCAA GAACGGCCCG AAGCAGCGGT TTGCGTTGGC TACTCCGTAC
GGCATGGCAG TGGATTCGAA GGGCCTGCTG TATGTCGCGG ACGGAAAAGT TGGAGCGATC
TTCATCTTCA ACACCGAGAC CCACGATGTC GACATGATCA AGAACGGAGT GCAGGCGCAC
TTCGGGCTGA TCACGGGATT GACGATTGAC GACGGCGACC GGCTCTTTGT TTCGGACTCG
CAGCTGCATC GGGTACTGGT TTTCGGGCCG GATCGCAAAC AGGAAGCGGT AATCAGTGAG
GGGCTGGTAG ATCCGGGCGG GATGGCGGTT GATAACGAGA ACCGGTTCCT TTATGTCGCG
GATCCGGCGC TCGACCAAGT ATTGGTGTAC GACGCCGACA AGTTCAACTT GATCCGCAAG
ATGGGGACTT CGGGAAAGAA CCACGCACTG ACGGAGCCAG GACAGTTTGC GCGGCCGACG
AACGTAGCGG TGGACAGCGA CAGCAACTTG TATGTGACCG ATACCTCGAA CCGGCGAGTA
GAGATTTTCG ACGCCGACGG ACAGTTCATT ACGGCATGGG GCAAGGCGGG CGATGGTCCG
GGAACGTTCG CACGGCCGAA GGGGATCGCG ATTGATTCCG ACGGGCACGT GTGGGTAGCG
GATGCCGCAC AGGACCGCGT GCAGTGCTTC AGCAAAGATG GAAAAGTTTT GTTGTACCTG
GGAGGACACG GATTGTTGCC GGGGATGTTC GGCAATGTTG CCGGACTGAC GATCGACAAG
AAGAACCGTG TGTACACCTC AGATCAGAAT CCGGGCCGGG TGCAGATGTT TCAGTACATC
AGCAACCCGG AGGCGCGTGC CGAGTGGGAA CGCCGGCAAG CGTTGGAAAA GGGTAAGACT
GGCGCGACAG CGACGGCTTC GCAAGCGCCG GCAAACAGTA ATAACAAGCC GAAGTAG
 
Protein sequence
MYGMSDKSKS KLATTATCIL ALVLGCAVPV FGGKDKKKDA AAPVQEESVL KKLDYSKIVW 
PNPPAITRLK YVDFFAGEKI QTQIVQEKKK SEWMARLAGG DSEGNGKNGP KQRFALATPY
GMAVDSKGLL YVADGKVGAI FIFNTETHDV DMIKNGVQAH FGLITGLTID DGDRLFVSDS
QLHRVLVFGP DRKQEAVISE GLVDPGGMAV DNENRFLYVA DPALDQVLVY DADKFNLIRK
MGTSGKNHAL TEPGQFARPT NVAVDSDSNL YVTDTSNRRV EIFDADGQFI TAWGKAGDGP
GTFARPKGIA IDSDGHVWVA DAAQDRVQCF SKDGKVLLYL GGHGLLPGMF GNVAGLTIDK
KNRVYTSDQN PGRVQMFQYI SNPEARAEWE RRQALEKGKT GATATASQAP ANSNNKPK