Gene Acid345_1369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1369 
Symbol 
ID4068845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1660523 
End bp1661551 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content60% 
IMG OID637983378 
Productputative transmembrane protein 
Protein accessionYP_590445 
Protein GI94968397 
COG category[S] Function unknown 
COG ID[COG4260] Putative virion core protein (lumpy skin disease virus) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.748341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.82922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCTAG GAAGCTTTCT CAAGAAGCAG TTCATTGATG TAATTGACTG GACCGAGCCC 
GAGGACGGGA TCCTCGCCTA TCGCTACCCG ATGCAGGACC GCGAGATCCA GAATGGCGGC
AAGCTCACGG TGCGCGACTC GCAGATGGCG CTGTTCGTAA ACGAAGGCAA GATCGCCGAC
CAGTTCAACC CCGGCCTTTA CACGCTGAAC ACTAATACCC TGCCGCTTCT GACGTACATC
ATGAACTGGG ACAAGGCGTT CCAGTCGCCG TTTAAGTCGG ACGTGTACTT CTACTCGACG
CGGCAGCAGA CTGACCAGCA CTGGGGGACG CCGAACCCGA TCACCATCCG CGATAAGGAT
TTTGGGATGA TCCGGATGCG CGGCTTTGGT ATTTACGCGT ACCACATCAG CGACCCAAAG
ACCTTCTACC AGAAGATTAG CGGCACGCGC GAGACTTACA CCGCGGCGGA GCTCGAAGGC
CAGCTGCGGA ACACGATCAT CGCGATGATG ACTGATGCGT TCGCCAACAG CCAGGTTCCC
TTCCTCGATC TTGCGGCCAA CCAGACGATG CTGGCGCAGA AGATATCGGA GAAAGTTGGG
CCGACGTTTA CCGGCTACGG CCTGACGCTG GACAGCTTTG TTGTCGAGAA CGTATCGCTG
CCCGATGAGT TGCAGAAGGT GCTCGACCAG CGGATTGAGA TGAATATGGT CGGCAATCTG
CAGCAGTACA CGCAGTTCCA GGCAGCGCAG TCGATCCCGA TTGCAGCGGC CAACGAAGGT
GGCGGTGCGG GCATTGGCGC TGGACTGGGC GCGGGCATTG CCATGGGACA GGCGATTGCG
GGAGCGGTGC AAGGCGGAAT GAATCCTCCG CCGCAGGGCG GCGCGCCGAC AGGCGGGGGA
GCACCTACGG GTGGCGCTCC GGCGGGCGGA GCAGCGACGG CGAGTGCGAC GAAGTTTTGC
ATGAGTTGCG GGAAGTCGAT TCCGCGGTCA GCGGGGTTCT GTCCGGAGTG CGGGCAAAAG
CAGGGATAG
 
Protein sequence
MTLGSFLKKQ FIDVIDWTEP EDGILAYRYP MQDREIQNGG KLTVRDSQMA LFVNEGKIAD 
QFNPGLYTLN TNTLPLLTYI MNWDKAFQSP FKSDVYFYST RQQTDQHWGT PNPITIRDKD
FGMIRMRGFG IYAYHISDPK TFYQKISGTR ETYTAAELEG QLRNTIIAMM TDAFANSQVP
FLDLAANQTM LAQKISEKVG PTFTGYGLTL DSFVVENVSL PDELQKVLDQ RIEMNMVGNL
QQYTQFQAAQ SIPIAAANEG GGAGIGAGLG AGIAMGQAIA GAVQGGMNPP PQGGAPTGGG
APTGGAPAGG AATASATKFC MSCGKSIPRS AGFCPECGQK QG