Gene Acid345_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3940 
Symbol 
ID4071323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4660708 
End bp4662174 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content58% 
IMG OID637985966 
Producthypothetical protein 
Protein accessionYP_593014 
Protein GI94970966 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.634833 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACT ACGGCATCCC GTTTTTCAGC GTTGTGATCT TCGCGGTTTC ACTGCTGGTG 
GGATGTGGAT CGTCTCCGCT GGCGGTGCAA CCTGCGCCGC CGCTGAGCGC GGAAAATCTC
AACCTGATTT TTGTCTCTAG CGAAGATTTG GCGCACCACG CTTCCGGCGA CGTCAGCGAG
GCTACGGCAA ACCTTACGAA CCAAGGGCTG CAGAGAGTTC TGCTCAATGC AGCTTTCCTG
CGCAAGAACG TGCTCGCTAA CCAGAATGTG AATGGAATCT ATGCGCTGGA GCCAATGACG
CACTTGCAGA CGGCAAGCCA GTATCCCGAC ATGGCTGCGC TGGAGATGGC GCAACAATTC
GCAGTGTTGA ACCAAGTCAC GTTGTCCAGT GACCAGTCCG GAGGCACGCC ATTCACGGGC
CAAAACTTTC CCATCAATGC ATCGTATTCT CCGAACGCAG TGCCGCCGGA TGTGCTGGCT
CCGCTGCAAT TCTGCCCGGC TTGCCAAGGG TTGGATTTTT CCGATGTGGG TGGTGACAAC
GAAGCGGTCG TAAGCAAGGT CTTAACCGCG AAGACCCCGG GTTTCTACGT GTTCGTCGCG
CCGTGGGAGA CGGTCCGCGA GTTAATGGTG AATGCCGATC GGACGGAAGG ATATGCCTTG
CAACTTCCGG AGGAATATCC CGGGCCGAAC ACGATCTATG CCATAGCGGT TGCGCCTTCC
GGCAGCGCAA GCCTCGTTGA CTATGACACC AAGGCAAATC CGGGCGCGTC TTATCCAACG
CTGCCCGCGC CAGTTCCAAC TACCACCTGC ACGGTTCGGA CTCCCGAGAG TGTGACGGTA
ACGGGTGGCG TAGATGGCGC GGTGGTTCCG GCGAATGCCA ATACCGACGA AGTGCTGTAC
ATGATCCGGC ATGCGGAGGC GCACCCGCAG GGATATTGGT CGGACAACAA CTATGTCGCA
GCCGGCAACT GGCGCGCATT GGCCCTTCCT TCCGCGTTGG AAGGCAAGAG CAATCCCGAC
GAGGTGTGGT CAGGAGACCC GTCGTCGTTC GGAATGGGAA CGATGAGCAA TACGGGCCAA
AACTATTTTT CAGGCGTAGC GCCACCCTTG ACCGTAGTGC CTTACGTGAT CGCTAAGGAC
CTTCCCTATC ACCTGGTGGC TGGCTTCGAC ATGACGGCGG CGGCCTCGGC GTCGCAGAGC
AGCCAGTTCT TCTTCACCGG AGGCAGGTTC TCGCAGCGCA AAGTACTTCT CGGGTGGATG
TACGTTCAGA ACCAGCAAAT CATCAACGCT TTGTTCGCAA GCTATTATCC GAACGGGGGA
GCGCCGGTGG TGCCGACGTG GTCTCCGTTG GATTACGACA GCCTCTGGAC GGTGACTTTC
GATGGGCAGG GCAACTTCAC TGTCGATTAC TCCCGATGCG AAGGAATAGA TTCGGCGGCG
CTGCCAGCGA CTGCGCCTCA GTTCTGA
 
Protein sequence
MKNYGIPFFS VVIFAVSLLV GCGSSPLAVQ PAPPLSAENL NLIFVSSEDL AHHASGDVSE 
ATANLTNQGL QRVLLNAAFL RKNVLANQNV NGIYALEPMT HLQTASQYPD MAALEMAQQF
AVLNQVTLSS DQSGGTPFTG QNFPINASYS PNAVPPDVLA PLQFCPACQG LDFSDVGGDN
EAVVSKVLTA KTPGFYVFVA PWETVRELMV NADRTEGYAL QLPEEYPGPN TIYAIAVAPS
GSASLVDYDT KANPGASYPT LPAPVPTTTC TVRTPESVTV TGGVDGAVVP ANANTDEVLY
MIRHAEAHPQ GYWSDNNYVA AGNWRALALP SALEGKSNPD EVWSGDPSSF GMGTMSNTGQ
NYFSGVAPPL TVVPYVIAKD LPYHLVAGFD MTAAASASQS SQFFFTGGRF SQRKVLLGWM
YVQNQQIINA LFASYYPNGG APVVPTWSPL DYDSLWTVTF DGQGNFTVDY SRCEGIDSAA
LPATAPQF