Gene Acid345_1415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1415 
Symbol 
ID4068756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1713002 
End bp1715617 
Gene Length2616 bp 
Protein Length871 aa 
Translation table11 
GC content60% 
IMG OID637983424 
ProductTonB-dependent receptor 
Protein accessionYP_590491 
Protein GI94968443 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.810305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0488884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTCTAC TCATAGTTCG TTGCCTATTT ATTTCTGCTT TATTCCTCGC CTCAACTGCC 
CTGCTTGCCG GTGATTCTGC GCCCGGCGAT CCACAATCGT CCGGCGGAAA TACCGGAGCG
ATCGAGGGCA CGGTTGCCGA TCCTACCGGC GCCGTCATTC CGAACGCCAC GGTCACGATC
AAAAACCCGG TGACCGGCTA CACGGCCAAG GCGGCGACCG GCAATGACGG CAGCTACATT
TTCCGCAACG TGCCGTTCAA CAACTACCAC GTGACTGCCG AGGCGAAGGG ATTCACCGCG
GCTGTTGCCG ACGCGGAGGT TCGCTCCGGC GTACCGTTTG CGATGAATCT GACGTTGCCG
ATCGCTGCGG CGGCCACGAC GGTCGACGTG CAGGGCGATG CCGGTGACCT GGTTGAGACC
GATCCTGTCT CGCACACCGA TGTCGATCGC GGACTGATCG ACAAGTTGCC GGTAGAAGGC
TCGTCGTCAC AGCTCAGCTC GGTCATCACG CTGTCTACGC CGGGCATTAC TGCCGACTCG
AACGGCCAGT TCCATCCGCT GGGCGAGCAC GCAGACACCT CGTTCTCGCT CGACAACCAG
CCGATGACGG ACCAGCAGAG CAAGGTCTTC TCTAACCAGA TTTCGACCGA TGCGATTCAG
TCCATGGAAG TAATTTCGGG TGTAGCTCCG GCAGAATTCG GCGATAAGAA TTCGCTCGTC
GTGCGCGTCG CTACGCGCTC GGGTCTTGGC CTGAAGCAGC CGACGGGTTC GATCTCGACC
ACGTATGGTT CGTTCGGAAC CAGCACCACC TCGTTCAATA TCCTGCAAGG CAACGACAAG
CTTGGAAACT TCTTCTCTGT TAGCGGACTC AACAGCGGAC GCTTTCTCGA CACGCCGGAA
TTCATGCCGC TGCACGCGCG CGGGAATTCG CAGAGTGGTT TCGATCGCGC CGATTGGCAG
GCAGGCACCG CAGATGTGCT GCATCTGAAT CTTGGCTTCA CGCGCTCGTG GTTCCAGATC
CCGAACAGCT ACGATCAGCA GTTCGCGCAG GAGACGCCGC AGGACCAGCG CCAGGAGATC
AAGAGCCTGA ACGTTTCTCC GGGCTGGACC CACACTTTCA ACAACAACAC ATTGCTCGCG
ACCACGGCAT GGTTCCGCCA GGACCAGGTT GGCTACTACC CGAGCGACGA TATCCTCGCG
GATCAGCCGG CAACGCTGAG CCAGTCGCGA CGGCTGACGA ATACCGGCAT CAAAACCGAT
GTTTCGTATG TGAAGGGCAT TCACAACTTC AAAGCTGGGG TGCAGTTCGA GCACACGATT
CTCGGTGAGA GCTTCGGCTT TGGTTTGACC GATCCGCTTT ACAACGCGCT TTGCGTGGAT
TCGTCAGGCT TGCCGGTGGT GGCAGCGGGA GTCATGAATC CGGGGGCGTG CGCGGGCTTC
TCGACCGGCT ACGCTCCGAA TCCTGGCTTC GATCCGAACT TGCTTCCCTA TGATCTGACG
CGCAGCGGAA TGCTGTTTAA CTTCGTCGGA CACGCCGATG TGAAGGAAGA GTCCATCTAC
GCGCAGGACG CCATCACGCT TGGCAAGTGG GTGCTGAACC TCGGCGTTCG CGGCGACAAC
TACAACGGCA TTTCGTCGGG CCACTTGCTG CAGCCGCGAC TCGGCGTCGC GTACAACGTC
AACAAGACGC ACACCGTTTT GCGGGCATCG TTTGGGCGCT TCTTTGAAAC GCCCTACAAC
GAGAATCTGG TGCTGAGCAG CGCAACCGGG GCGGGCGGCC TTGCACAAGG CGGCGAAGCG
ATTCCGATCC AGCCAGGCCA TCGCACGCAA TACGACGCTG GATTGCAACA GGCAATCGGC
AAGTGGGCCG TGGTGGACGC GGAGTACTTC TGGAAGTTCA CCAAGAACGC CTACGACTTC
GACACGCTCT TCAACACGCC GCTGGCGTTC CCGATCGAGT GGAAGCAGGC GAAGATTGAC
GGCTTCTCCG CGCGCGTCAC CTTCCCGACG TACAAAGGCG TGACCGCGTA CACCGTGCTC
AGCCACACCC GGGCGCGATT CTTCCCGCCG GAGAACGGCG GCTTGATCTT CAATTCCGAT
CTGAGCACCA CGCCGTTCCG GATTGACCAC GACCAGGCGT TCGGCGCTTC AACCAACGTT
CAATACCAGC CGAAAAAAGA CGCGCCGTGG ATCTCCTTCA CCTGGCGCTA TGACAGTGGC
GAGGTTGCGG GTGCGATTCC GGATTTCGCG ACCGCACTGA CGCTTACCGG CGACGAGCAG
GCGCAGATGG GGCTCTTCTG TGGCGACGTC TTTGCGGCTC CGGGAGCGCC GATTCGTTCG
TGCGCTGCGG GCATCGGCGC AACGCGGGTG GTGATTCCGG CGGCTGGAAC CTATGACGCG
GACAAGAACC CGGCGCGCAT CGCATCGCGC AACGTCCTCG ACATGGGGAT TGGCTGGGAC
AATATCTTCC ACGCCGACCG TTATAAAACT GCGGTCAGTT TCACCGTTGC GAACCTGACG
AACAAAGACG GGCTGTACAA CTTCCTGTCC ACATTCAGCG GGACACACTT CATCCCGCCG
CGGTCGTACA CGGGACAGGT GAGCTGGCAC TTCTAA
 
Protein sequence
MRLLIVRCLF ISALFLASTA LLAGDSAPGD PQSSGGNTGA IEGTVADPTG AVIPNATVTI 
KNPVTGYTAK AATGNDGSYI FRNVPFNNYH VTAEAKGFTA AVADAEVRSG VPFAMNLTLP
IAAAATTVDV QGDAGDLVET DPVSHTDVDR GLIDKLPVEG SSSQLSSVIT LSTPGITADS
NGQFHPLGEH ADTSFSLDNQ PMTDQQSKVF SNQISTDAIQ SMEVISGVAP AEFGDKNSLV
VRVATRSGLG LKQPTGSIST TYGSFGTSTT SFNILQGNDK LGNFFSVSGL NSGRFLDTPE
FMPLHARGNS QSGFDRADWQ AGTADVLHLN LGFTRSWFQI PNSYDQQFAQ ETPQDQRQEI
KSLNVSPGWT HTFNNNTLLA TTAWFRQDQV GYYPSDDILA DQPATLSQSR RLTNTGIKTD
VSYVKGIHNF KAGVQFEHTI LGESFGFGLT DPLYNALCVD SSGLPVVAAG VMNPGACAGF
STGYAPNPGF DPNLLPYDLT RSGMLFNFVG HADVKEESIY AQDAITLGKW VLNLGVRGDN
YNGISSGHLL QPRLGVAYNV NKTHTVLRAS FGRFFETPYN ENLVLSSATG AGGLAQGGEA
IPIQPGHRTQ YDAGLQQAIG KWAVVDAEYF WKFTKNAYDF DTLFNTPLAF PIEWKQAKID
GFSARVTFPT YKGVTAYTVL SHTRARFFPP ENGGLIFNSD LSTTPFRIDH DQAFGASTNV
QYQPKKDAPW ISFTWRYDSG EVAGAIPDFA TALTLTGDEQ AQMGLFCGDV FAAPGAPIRS
CAAGIGATRV VIPAAGTYDA DKNPARIASR NVLDMGIGWD NIFHADRYKT AVSFTVANLT
NKDGLYNFLS TFSGTHFIPP RSYTGQVSWH F