Gene Acid345_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3174 
Symbol 
ID4071244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3764498 
End bp3766108 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content62% 
IMG OID637985194 
Productphage portal protein, lambda 
Protein accessionYP_592249 
Protein GI94970201 
COG category[R] General function prediction only 
COG ID[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.474901 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATCG AGACGCTTTT CACGACGCCG GAAGCACCGC GGGCCTCGGC CCTGCAAACG 
CTGTTCGATA ACCGCGTGCC TGCGCCTCCT GCTCCTGCGA AGTCGGAGCC GTTGAAGGCG
AGCACCACCG GGTATCCGTC CTATGCCGGC GCCGGACTCT CGCGCCTCAA TGCCGACTGG
ATCGGCACGC TACTTTCCAG CGACCAGGAA GTCCGCAACT CCCTCAAGCG CTTGCGTGCC
CGTTGCCGCC AGCTCCACAA CAACAACGAC TACGCAATGC GCTTCGTGAA CCTCGTCAAG
CGCAACGCGG TCGGCCCCAA TGGGATCCAG CTCGAGGCGC AATTGCAGGC CGACCAGGAT
GAACTCGCCG AGCAGGTGAA TGACGAGCTC GAGCGCGGCT GGCGCAAATG GTGTCGCAAA
GGCAATCCCA CGGCCGACGG CAAGCTCTCC TGGGTCGATG TGCAGAACCT GGTGTGGGAG
TCGTTGATCG TCGATGGCGA AGTCTTCTTG CGCAAGATCG TCGGCTTCCC CGACAACGAT
TTTGGTTTCA CGCTTCAGTT CATCGACCCT GACCAGGTGG ACGTGCAGTT CAATCGTCCT
CGCAAGGTCG ATTCCGCGCG CGGCACCGTG CAGAACGAAG TCCGCATGGG CATCGAGGTA
AACGAATGGC TGCGCCCGAT CGCCTACTGG GTGCTCGATG GCCACCCGGC GGAAGGGCAC
GTCAAGCGCA CTGCGATTCC CGCTTCCGAC ATGATCCATA TCCACATGTT CCGCCGCGGC
AACCAGACGC GCGGTGTGCC CTGGCTCGTG ACCGCCATGA GCCGCATGAA CATGCTGGGC
GGATATGAAG AGGCTGAGCT CACGTCAGCG CGCGTCGGTG CCTGCCAGGG CGGTTTCTTC
GTCTCGAAAA CCGGTGAGGA ATATACCGGC CGGAAAAATA AGGACGATGG TTCCGTCGAA
GTGTCCATGG AGCCGGGTCT TTTCGAACAG TTGCCGGAAG GCGTTGATTT CAAGCCCTTC
ACTCCGCAGC ATCCCAACGC TGCTTTCCCT GAGTTCGTCA AGGCTATGGT GCGCGGTATG
GCGGTGGGCC TCGATATCAG CTATCCGTCG CTCGCGGGCG ATCTGCGCGA AGTCAATTTC
TCTTCCATCC GCCAGGCGGT CCTCGAAGAG CGCGAGATGT ATCGCACCCT GCAGACGTTT
GCAAAGGACC ACCTCAATCA GCCCGTATAC GAGGCCTGGG TGCCAGCGGC GATCCCGCGC
AAGCAACTCG CGCTGCCCGC CGCCGGCATA GATGAGTACG TGGATCCCGA GAATCTGCGC
TGGGTCGGTC GCGGCTGGAC CTGGGTGGAT CCACTGAAGG ACGTGCAGGC CGGCAAGGAA
GCGCGCGGCA GCGGCCAAAC CACGCTCGCC AAGCTCTGTG CCGCGCAGGG TGAGGATTGG
CGCGACGTCA TCGACCAGAT CGCGATCGAG GACGACTATG CGGAGAAGAA GGGCGTGATC
CTGAATTTTG CGGTCACCAA GAGCGCGGAT GGTTTGCCCG CCGTTACACC CGATCCAAGT
GCACCGCCTG TGCCGGTCAA GGATGGAGAT GAGGAGGGCG AGAACCAGTG A
 
Protein sequence
MRIETLFTTP EAPRASALQT LFDNRVPAPP APAKSEPLKA STTGYPSYAG AGLSRLNADW 
IGTLLSSDQE VRNSLKRLRA RCRQLHNNND YAMRFVNLVK RNAVGPNGIQ LEAQLQADQD
ELAEQVNDEL ERGWRKWCRK GNPTADGKLS WVDVQNLVWE SLIVDGEVFL RKIVGFPDND
FGFTLQFIDP DQVDVQFNRP RKVDSARGTV QNEVRMGIEV NEWLRPIAYW VLDGHPAEGH
VKRTAIPASD MIHIHMFRRG NQTRGVPWLV TAMSRMNMLG GYEEAELTSA RVGACQGGFF
VSKTGEEYTG RKNKDDGSVE VSMEPGLFEQ LPEGVDFKPF TPQHPNAAFP EFVKAMVRGM
AVGLDISYPS LAGDLREVNF SSIRQAVLEE REMYRTLQTF AKDHLNQPVY EAWVPAAIPR
KQLALPAAGI DEYVDPENLR WVGRGWTWVD PLKDVQAGKE ARGSGQTTLA KLCAAQGEDW
RDVIDQIAIE DDYAEKKGVI LNFAVTKSAD GLPAVTPDPS APPVPVKDGD EEGENQ