Gene Acid345_4072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4072 
Symbol 
ID4072494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4821136 
End bp4822530 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content59% 
IMG OID637986103 
ProductRND efflux system, outer membrane lipoprotein, NodT 
Protein accessionYP_593146 
Protein GI94971098 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01845] efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.153495 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTGCA AACGAATTCT CCTTGCGCTC CTGTTGCTCC TGATGGCCGG CTGCAAAGTC 
GGACCGAACT ACAAACGCCC CGCCGTGACG GTTCCGGATG CTTACCGGGG GCCGACCTTG
GACGGCGGAC AAGCCAACGG CATCTCCCTG GGCGAGCAGA AGTGGTGGGA TGTCTTCAAC
GATGAGCAAC TGCAAAAGCT GATCCGACAA GCTCTCGATG CTAACTATGA TGTAAAGATC
GCCGCGACTC GCGTGCTCCA GGCACAGGCC GCGCTTGGGA TCACACGCGC CGATCAGTTC
CCCACGATTG CTGGGGGCGC ATCGGCCCTT AACGAGCGTA TCCCGCGGGT GAAAGGCTTG
CCGGCTTACG AAAACAGCGC GCTCCAGGTA AACCTCTCCC TCGTCTGGCA GCTCGACTTC
TGGGGTAAAT ATCGTCGCGC CACAGAAGCA GCGCGTGCAG ACTTGCTCTC CACCGAGTGG
GGCAAGCGCG CCGTCATCAA CAGCGTCATC AGCAACGTCG CCAACGGGTA CTTCCAGTTG
CTTGAGCTCG ATCGCGAGAT GGAGATCGCC AAGGGCACAT TAGCATCGCG CCAAGAATCG
TTGCGCTTGG TGAACATCCG TCAAAAGGGC GGAACGACTT CCCTGCTCGA CGTACGTCAG
TCGGAACAAC TTCTCTACAC GGCCGCCGCT GCCATTCCCG ACCTCGAACG CCGGATCGAG
CAAGAGGAGA ACTTCATCAG CATCCTCCTC GGCCAGAATC CCGGTCCGAT TCAACGGGGC
AAGCCGCTCG TCGAATTCGC GATTCTTCCT TCGGTTCCGC CCGGATTGCC CTCGACTTTG
CTTGAGCGGC GCCCGGACTT GCAGTCGGCG GAGCAGCAAC TGGTCGCAGC GAATGCGCGT
ATTGGTGTCG CGAAAGCTGA CTACTTTCCA CAGATCTCTC TTACCGCCCT CGGTGGATAC
CAGAGCTCGG CGCTAACGGG GCTCTTTTCC GGCCCTGCCG GTTTGTGGAG CTTCGGCGGT
CAACTGGCCC AACCGATCTT CACCGGCGGC AAAATTAGAT CGAACGTGAG ATTAACGGAA
GCTCAGCAAC AAGAAGCGGT GTTCCGCTAT CAACAGTCCA TTCAGCAAGC GTTCCGTGAA
GTCTCGGATT CGCTGGTGGC CTATCGCAAG AACCAGGAGT TCCGCGAACA GGAAGCAAAC
TTGGCGGCTT CTGCCGTGGA TGCTACCCGC CTCGCGCGCA TTCGCTACGA AGGCGGTGTA
TCCAGCTATC TCGAGGTTCT CGATAACGAC ACTCGCTCGT TCGACGCTGA GATCTCGCTT
GCCCAGGCAC AACTCGGCGA ACGCGTCGCA TTGGTCCAGC TCTACAACGC ACTCGGCGGC
GGCTGGCAGC AGTAA
 
Protein sequence
MNCKRILLAL LLLLMAGCKV GPNYKRPAVT VPDAYRGPTL DGGQANGISL GEQKWWDVFN 
DEQLQKLIRQ ALDANYDVKI AATRVLQAQA ALGITRADQF PTIAGGASAL NERIPRVKGL
PAYENSALQV NLSLVWQLDF WGKYRRATEA ARADLLSTEW GKRAVINSVI SNVANGYFQL
LELDREMEIA KGTLASRQES LRLVNIRQKG GTTSLLDVRQ SEQLLYTAAA AIPDLERRIE
QEENFISILL GQNPGPIQRG KPLVEFAILP SVPPGLPSTL LERRPDLQSA EQQLVAANAR
IGVAKADYFP QISLTALGGY QSSALTGLFS GPAGLWSFGG QLAQPIFTGG KIRSNVRLTE
AQQQEAVFRY QQSIQQAFRE VSDSLVAYRK NQEFREQEAN LAASAVDATR LARIRYEGGV
SSYLEVLDND TRSFDAEISL AQAQLGERVA LVQLYNALGG GWQQ