Gene Acid345_3071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3071 
Symbol 
ID4072635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3649614 
End bp3651305 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content61% 
IMG OID637985090 
Producthypothetical protein 
Protein accessionYP_592146 
Protein GI94970098 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAATCG CGCCAATCTC GCGACTCTCC CTCCTCGCCG ACTGCGGGCT GCTCCTCCTC 
GCACTGCATC TCCCCATTCA CGCCCAGGAA CAAGCCAATA CGCCGAGAGA TCTCGTTTAC
AGCGGCCCAT GCAGACTCGG CGAAGCGCTG CCCGACACGC CGCGTTTTGA AGTGACCGGC
GTCGTCCTGA ATAACGTCAC CGGATCGCCG ATCGAAGGAG CTACCGTCAA GCTCAGTAGC
GAATGCATAT CCGACGAGCG CGCCCGCGCC ACCACAACCG ACGATGAAGG CAGGTTCACG
TTCACCCACG TCCGCGGAAT GGATGCCTAC ATCACGGCCA CCTTCGGCGA GGCTTTCCCG
GAGTGGAATG TTGGCCGCCG CGCCGACGAC CCGCTGAATC GCTACACGAT TGGCCCGCAC
ACCGGAGTGA TTACCCTGCG CCTTGCGCCA CCCGCTTACA TCACCGGTGT CGTGCGCGGC
GCAGACGGCG CTCCGCTCTC TCCGGCGATG GTAACTCTGC GTTGTCTCCG CCCGTGGGGC
GGCTGGCCGC AACACGAGGG CTGCAGTTCG TCCAACGTGA AACCCGACGG GTCGTATCGT
CTCGGCCCGC TGCTGCCCGG TCGTTATGCG GTCGTTATCG AGCCCGAGGT GGAATTCGGA
AAAGCTCCTG CCCCTGATGC CGATGGCGTA ACCCGCAGCT ATGTGCCGGT GCGCCAGCCT
GCGCTTACCG ACGACGAGAG TTGTCCCTAT TTCGATCTGA AGGAAGGCGA GCAGAAGCGG
CTCGACTTTA AGCTCAAGCG CGAAGTGTTG CACCACATCA CGGGCGCGAT CACCGGCAAG
TCTTGGACGA CCGTGAACGT CGTCGATCGC CTCGGCTCGC AATCTTATCC GGTGAAACTT
CTGGCGCAGT GTTGCGAGTT TGAGGCATGG GCGCCAAACG GAAGCTTCCG GATAGTCGGC
GACGGCAACC TCAAAGGCGA AGTCTCGATC AAGGTCCAGG ACGGTGATCT TTCCGGCGTG
GCGCTCCCCG CACACTCCGA CGACCGCCTC ACCATCCCCA TCGAAGTCTC GAGTACTGCT
CCGCCCAACG AGAACAGCGT TTGCCTGTTC GGCGAGACAG CTTGTGGCTT CTGGTACGCG
AACTTCCTCC GCTTCAACCC GCAAGGTGAG TTTGACGTAG TGCTTCAATC CAGCATGAAC
GGCAGCACGT CCGACGGAGT ACGACATGAG TCGGTCGAGG TTCCCTCCGG CAATTACGAG
CTGATCGTTT CGACCACAGG CAACGTCTAC GCCCAAACCA TCTCATCGGG CGCGACAAAT
TTGTTGCGCG AGCGTCTCGC CGTGAATCCG GGCGACATAC CCTCGCCCAT TCGCATCGTG
CTCGCCGAAG GCGAAATCGT CACCGGCACA ACTCTGCGCA ACGGCAAGCC GGCACGCGCG
TTCGTGTATG CCGTTCCCAG CGAGAACGAT GCTCGCGCTT TTCAGGGCGT CCCCAGCGAC
GAGCATGGGC AGTACAAGCT TGAGGGGCTC GCGCCTATCC AATACCACTT CTTCGCTTCC
GACGTGGAAC TGAACCTCGA CCTTCACGAT CCCGACGCAA TGCGCCCCTG GCTGCAATCC
TCCGAGACGC GCAGCCTTGC ATCCGGGAGT ACCACGTCGC TAGATCTACA CGTGTTGACT
CCTGCGAAGT AA
 
Protein sequence
MQIAPISRLS LLADCGLLLL ALHLPIHAQE QANTPRDLVY SGPCRLGEAL PDTPRFEVTG 
VVLNNVTGSP IEGATVKLSS ECISDERARA TTTDDEGRFT FTHVRGMDAY ITATFGEAFP
EWNVGRRADD PLNRYTIGPH TGVITLRLAP PAYITGVVRG ADGAPLSPAM VTLRCLRPWG
GWPQHEGCSS SNVKPDGSYR LGPLLPGRYA VVIEPEVEFG KAPAPDADGV TRSYVPVRQP
ALTDDESCPY FDLKEGEQKR LDFKLKREVL HHITGAITGK SWTTVNVVDR LGSQSYPVKL
LAQCCEFEAW APNGSFRIVG DGNLKGEVSI KVQDGDLSGV ALPAHSDDRL TIPIEVSSTA
PPNENSVCLF GETACGFWYA NFLRFNPQGE FDVVLQSSMN GSTSDGVRHE SVEVPSGNYE
LIVSTTGNVY AQTISSGATN LLRERLAVNP GDIPSPIRIV LAEGEIVTGT TLRNGKPARA
FVYAVPSEND ARAFQGVPSD EHGQYKLEGL APIQYHFFAS DVELNLDLHD PDAMRPWLQS
SETRSLASGS TTSLDLHVLT PAK