Gene Acid345_0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0189 
Symbol 
ID4073076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp200211 
End bp201365 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content57% 
IMG OID637982189 
ProductPilT protein-like 
Protein accessionYP_589268 
Protein GI94967220 
COG category[R] General function prediction only 
COG ID[COG4956] Integral membrane protein (PIN domain superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.919011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.108037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATCTGG CAATCATTCG TCTAGTTTTC GTGGTTCTTA GTGCGCTGGC ATGCTTTGAG 
TTAAAGCCTT TTGGATTGAT GGGCTACGAG GCCGCTGGCG TTGGTGTTCT CATCGGAATC
GCCGTGGTCG TGTTCGAGAT GCGGCTGCGT AATGCCAGCC TGAAGCGGCT GATCGGCGCG
GTGATCGGCA GCATTCTTGG CATCTGCGGC GCCTACCTGT TTAGTCTCGT CATTCGCGAC
GCCATTAAAG AGGGGCCGAC GCAGCATTTC CTCGAGCTCT TCGTGATGCT CCTGATGGCG
TATGTCGGCT TGGTGATTGG CGCCGGCAAG GGCGATCTGC TCAATCTTGC GGCGCTGGGC
GGAATCTTTG GCGGCGAGAA ACAGTCGAAG CGCAGCTACA AGATTCTGGA TACCTCGGTC
ATTATTGACG GGCGAATTGC CGACATCGCG GAGACGGGCT TCCTTGATGG AACGATTGTG
ATTCCGCAGT TCGTGTTGCG CGAGTTGCAA CTGGTGGCGG ATTCGGCGGA CTCGCTGAAG
AGGAACCGTG GACGTCGTGG GTTGGACATT CTGCAACGCG TGCAGAAGTT GACCAATTTG
CACGTACAAA TTGTGGAAGA CGATTTCCCG GCAGTCCGCG AGGTCGACCT AAAGTTGATT
GAACTCGCCA AGGTGTATGA GGGGAAGATC GTCACCAACG ACTTCAACCT GAATAAGGTG
GCACAGCTGC AGGGCGTTGA GGTGCTGAAC ATCAATGAGC TGGCCAATTC GCTGAAGCCG
ATCGTGTTGC CGGGCGAGCT GATGCGAGTG TTCATCCTGA AAGAAGGCAA GGAGTACAAC
CAGGGCGTGG CGTATTTGGA TGACGGCACC ATGGTTGTGG TGGACAACGC GCGGAAGATG
ATCGGGAAGA CGATCGAGAT TTCGGTTACA TCGGTGCTGC AGACGACTGC GGGCAAGATG
ATCTTCGGCA AGTTTGATGA GCGGGCCTCA GGCACGATGC CGCGCGCGGA GCAGAGGCCG
GAGCGTCAGG ACTTGCGGAA GAGCAATCCG CAGCCGTCGT TCCCTGCTGC CAATGGAACT
ACCAGCACAC CGCCGAGTTC GCCTGGGATT CCGGCAGCGT CTTCGCCGGG TACGGGCGGT
ACGGGCACGG AATAG
 
Protein sequence
MDLAIIRLVF VVLSALACFE LKPFGLMGYE AAGVGVLIGI AVVVFEMRLR NASLKRLIGA 
VIGSILGICG AYLFSLVIRD AIKEGPTQHF LELFVMLLMA YVGLVIGAGK GDLLNLAALG
GIFGGEKQSK RSYKILDTSV IIDGRIADIA ETGFLDGTIV IPQFVLRELQ LVADSADSLK
RNRGRRGLDI LQRVQKLTNL HVQIVEDDFP AVREVDLKLI ELAKVYEGKI VTNDFNLNKV
AQLQGVEVLN INELANSLKP IVLPGELMRV FILKEGKEYN QGVAYLDDGT MVVVDNARKM
IGKTIEISVT SVLQTTAGKM IFGKFDERAS GTMPRAEQRP ERQDLRKSNP QPSFPAANGT
TSTPPSSPGI PAASSPGTGG TGTE