Gene Acid345_0742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0742 
Symbol 
ID4069084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp914127 
End bp915194 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content58% 
IMG OID637982748 
ProductNHL repeat-containing protein 
Protein accessionYP_589821 
Protein GI94967773 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.150031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGACGTAC GAGGAGCAAG GGCCGCAGTA CTTACCTTGT CGATCCTCGT TCCTTTGACG 
TGCCTTGCCG CAACGAAAGA GAAGCCCGCA GAAGTTTCGG TTCCGGCGAT CGAGATTGAG
GGTGGACGAC GCCTCACCTT CGAGCGGATG TTTACGACCG ATCGTGACGT CCTCGGCAAG
AAAGGCTTCT GGACGAAGGT GGTGGACTTC GTCGCCGGTG AACCAGACGA ACATTTCCTA
GTCAGACCCT ACAGTATCGC GGTGGATTCG CGCGGGCGAG CGATTGTCAC CGATCCGGGC
GCGAATGGCG TGCACATCTT CGACCTCGCC CAGCATAAGT ACAAGTTCGT CGAACGCAAT
GAGAAGGGCA AAGAGTCGAT GCTCCAGCCG CAATGCGTGG CGGTGGATGC GCACGACAAC
TTCTACGTCA CGGACTCTGA GACCGGCAAG GTCTTCGTCT TTAATGCTGA CGGCAAGTAT
CAGCGCTCGA TTGGCGCCTT GAAGGGTGGC GAAGGATTCT TCAAGCGGCC TACCGGGATT
GCGATTGATT CGGCGGCACA GCGCGTGTAC ATCACCGACA CCCTACGCGA CAAGATTTAT
GTCACCGACA TGCAGGGCCA AGTACTTGCC ACGATCGGCA AGCCGGGATC GGAACCTGGC
GAATTGCACT ATCCGACCGA ACTGCGCATT GTGGGCGACG AGCTGGTGGT GGTGGATGCG
ATGAACTTCC GCATCCAGAT CTTCGGAAAA GATGGCAGCT ATCGCGGCAG CATTGGCGAG
ATCGGCGATA CGCCGGGCGC GATGTTTCGT CCCAAGGGCG TGAGCGTGGA TTCCGAGAAC
CACATCTACG TGGTGGAAGG TGCGAGTGCG CGGGTACAGA TTTACGACCG CGAAGGCCAC
TGGCTGTACT GGTTTGGCGG AAAAGGCACG GGGCCTGAGG AGTTTCAGCT TCCTTCCGGC
ATTTTTATTG ACCACGAGGA CCGCATCTTC GTGGTTGACT CGTTTAATCG CCGGATCCAA
GTGCTGCATT ATTACGGCGT CGGTAAGCGT GCAGGAGGCC AGCCATGA
 
Protein sequence
MDVRGARAAV LTLSILVPLT CLAATKEKPA EVSVPAIEIE GGRRLTFERM FTTDRDVLGK 
KGFWTKVVDF VAGEPDEHFL VRPYSIAVDS RGRAIVTDPG ANGVHIFDLA QHKYKFVERN
EKGKESMLQP QCVAVDAHDN FYVTDSETGK VFVFNADGKY QRSIGALKGG EGFFKRPTGI
AIDSAAQRVY ITDTLRDKIY VTDMQGQVLA TIGKPGSEPG ELHYPTELRI VGDELVVVDA
MNFRIQIFGK DGSYRGSIGE IGDTPGAMFR PKGVSVDSEN HIYVVEGASA RVQIYDREGH
WLYWFGGKGT GPEEFQLPSG IFIDHEDRIF VVDSFNRRIQ VLHYYGVGKR AGGQP