Gene Acid345_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0073 
Symbol 
ID4068986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp75499 
End bp76992 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content61% 
IMG OID637982073 
ProductDNA uptake lipoprotein-like 
Protein accessionYP_589152 
Protein GI94967104 
COG category[R] General function prediction only 
COG ID[COG4105] DNA uptake lipoprotein 
TIGRFAM ID[TIGR03302] outer membrane assembly lipoprotein YfiO 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCGTC GTGCTCTCAT AACCGCTGCC ATCGGATTGG CAACGCTGGC AGCCACCGGC 
TGTCACAACA AGAAGGTTTC CAACCCGATC GCCAACATCG ACTCCAAGCA GCCGGACAAG
GTGTTGTATG ACCGCGCGAT GGACGCGATG AAGCACAACA AGTTCGATGT TGCGCGCGTC
ACCCTTCAGA CCCTGATCAA CACCTACCCG GATTCCGAGT TCATCGCCCG CGCCAAGCTC
TCGATCGGCG ATTCCTGGTA CGCCGAAGGT GGATCAGCGG CCATGACCCA GGCCGAAAAC
GAATACCGCG ACTTCATCGT CTTCTTCGGA CAGTCCATGC CGAACGAATC GGCGGAAGCG
CAGATGAAGA TCGCCGGCAT CCACTACGAC GAAATGGAGA AGCCAGACCG CGATTACACC
CACGCCAAGC GTGCGGAAGA AGAATATCGC CAGATGATCC TTCAGTTCCC CGACAGTCCG
CTGGTTCCGA AAGCGAAAAC GCGGCTGCTC CAGGTACAAG AGATCCTGGC GCAACGCGAG
TTCCTCATCG GCAAGTTCTA CATCATGCGC GAGGACTATC CGGCGGCCGT GGCGCGCCTG
CAAACTCTTT CCGACACCTA TCCGTTGTTC AGCGGCTCTG ACGAAGCGCT CTTCTTGCTC
GGCGAAGCAC ACCAGGCAGA AGCGAACCTG GTCCGCAAGG CGTCGCGCCT GGCCGAGACC
CAGCGCGCTA ATGCCATCGC CGGTTTCGAA AAGGACGCTG TCGCCGCCTA TTCGAAAATC
ATCACACGCT ATCCGGCAAC CGATCGCGTA GAAGCCGCGA AGAAGAAGCT CGCCGAACTT
CACGCGCCTA TCCCGACGCC AACGCCGGAA GCCATCGCGC AGAGTAAGGC GGAAGAAGCA
GGCCGCGGGC ATTTGACCAA GAAGTCGGAA ATGATGAGTT TCATGAAGCA CCATCCGGAC
ACTTCTATGG CCACCAATCA TGGCGATCCC ATCATGGTGG ATCCGCCGCA GACCAGCGCC
GTCGACCTCG TACACAACGC CAACGCAACC TTGATTGGAA AGCCTGCTCC CACCAAGGGC
GGCAGCGGTA TTACCCTCGA GACACCGACC GGCACCGGTC CAGCGCCGGA AAACCAGCCG
ATCCCGCGTT CGGACAGCGG CGCGACGGCT CCCACAACCG CAGCTCCCGT GGAAACCGGT
ACGCCGACGC CCACCGCTCC ATCCACGAGC GGCGGGGCCT CCAGCACCAC ACTCAGTGCA
GCTCCTGCCA GCACCGCTAC CGCGGCTCCG ACCAGCGCAC CTGCGACCAA TACGGCAGCA
GCGCCGGCAC CAACCCGTGT GAACGACGCC AATACCGATT CGAGCAACGC GACGACGCAA
TCCACCACCA GTTCTACCGC AGCGCCGGCC GCGCAGGACT CGAGCCAGAC CCCGCAAGAT
TCTTCGAGCA AGAAGAAGAA GAAGCACGGG ATCGGAAAGC TCAACCCGTT CTAG
 
Protein sequence
MFRRALITAA IGLATLAATG CHNKKVSNPI ANIDSKQPDK VLYDRAMDAM KHNKFDVARV 
TLQTLINTYP DSEFIARAKL SIGDSWYAEG GSAAMTQAEN EYRDFIVFFG QSMPNESAEA
QMKIAGIHYD EMEKPDRDYT HAKRAEEEYR QMILQFPDSP LVPKAKTRLL QVQEILAQRE
FLIGKFYIMR EDYPAAVARL QTLSDTYPLF SGSDEALFLL GEAHQAEANL VRKASRLAET
QRANAIAGFE KDAVAAYSKI ITRYPATDRV EAAKKKLAEL HAPIPTPTPE AIAQSKAEEA
GRGHLTKKSE MMSFMKHHPD TSMATNHGDP IMVDPPQTSA VDLVHNANAT LIGKPAPTKG
GSGITLETPT GTGPAPENQP IPRSDSGATA PTTAAPVETG TPTPTAPSTS GGASSTTLSA
APASTATAAP TSAPATNTAA APAPTRVNDA NTDSSNATTQ STTSSTAAPA AQDSSQTPQD
SSSKKKKKHG IGKLNPF