Gene pE33L466_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0221 
Symbol 
ID3399966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp223319 
End bp226396 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content35% 
IMG OID637660053 
ProductTraG/TraD family conjugal transfer protein 
Protein accessionYP_245717 
Protein GI67078097 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAGAATG CAGTTCAAAA GAATGCTGAA CTCAATAATG CACCAGGTAA TGGTGAGGAT 
CCAAAAGCCG GAAAGAGCAA AAAAAATCCC TATACAGTTA CAATCGACAT TACAAAAAGC
TGGAAATATC GTTTAGCAGC CCGTTCATCT TACTATGTTG TGACATTACT ACTTTTGCTA
GCTTGGTATT TTGCTTTTAG AGGAATATAT AAATACGCTT CTTTCTGGTC AATACAAATG
GGCGCTCCAT TGCCTACTTT AAGTATCTTC GGTAATCCGT TTGAAACTAA TTATTACCTT
GAAACCTTTT TTAGCTTTTG GATTTTAATT TATACTTGGC AACTATCAAC CAAGGTTAAG
ACCTACCAGG AGATAAAGCG TAGATGGTTC ATATTCTCAA TGATGTTAAT TGGTATAGTT
GCCCAGTACT TATGGGCGTT TAGTGTTCCA TTAGCTAAAA TATTTATTCC ATTCTTAAAT
TCGAAAGCTG GAGAAGTTTC ACTAAATGAC AAAGGATTAG AGGATACTAT ACTTTCAAAT
TTTAGTAATA TCATGCACCT TGTATTTGCA ATACCTGTAA TTATTATTAT TCTCGTACTC
TTATGGCTCT TTAAGATTTT TTATGAACAT AAAAAAGAGC TTTTAGATGA ATTTGGAAAA
TGGGAATATG TGTTTAAAAT ACCTGGTTGG ATGCTTTCCT TTTTGAATGA TGATCGTCAA
CGAAAACTTG CTACCTCATT ACACAGATTT TTCACTGAAA AAAATCCATC AAAATTGCCA
GAACCCGATA TCTTTCTTGG CCCTAACAGT GCTACACGGG AAATGGCAGT AATACAGGGC
AAGTCTTTAA CACTTAATAT TATGATCATT GGTAACATTG GTACAGGTAA ATCAGCTGCA
CTTGGATTAC CAATTGCCAA TCAAATTCTT GACTATATGG CATCAATGAT TAATAACTTT
AAAACTTTAT ATGCAAGAAA AGATTATCAT TCAGAGGATG TAAAAGGAAC GCAGGTTAAT
GGTCTTACAG TTATCGAGCC ATCTAATGAC TTCTGCGAAA AAGTATATAA ATTGGTATTA
GCACACAAGA TACCCGAAAG TGTTATTTTC TATTTAGACC CAACAAATCC TGATACCCCT
TCAATAAATT TTGTACGTGG TCCAGTTGAT AAAGTTGCTG AGATGTTATG TTCAGTATTA
ACGGGTCTCT CTGATAATGG AGCAGGAAAT CCGTTCTTCG TTCAATCTGA GCGTTCCCAT
CTTAAACAAC ATATTTATTT GTTAAAACTC CATGATTCTA GTTTTGAAGC TAGATTTGAG
CATTTAATTG ATATGTATAA TGATGCTAAT CTCGTTTTTG AGATGCATCT CAAATTAAAA
AAACGACTTC CAAATGATAT TGAATCTATT CCAGATAGAG ATGAACGAAA CCATTGGCGC
ATAATGAAAC AGGTAGATGA ATGGTTTGAT TTAAATTATG TTCCTGAAAT TTCTGGTGGT
CGTGGGGGAG GGGAAGTTGT TTATCATACC GAAGGGAAAT ACTATGGTCA ACCAAAGATA
ATTGATAAAC AAGAAACCTT TGTTCGCGGT TTACGAAACA CATTAAATGA TATTTCTGCT
CAACCTCTGC TCCGACGTGT TTTATGTGGC CCATCGGATT TTGATTTTGA GAAACACCTG
GAATTTGGAG GTATCCTTTT AGTCAATACA GCTAAAGGGG AACTTTCTGA TCTATCTGAT
GTATTCGGTA AATTATGTTT ATATGCCGTG CAGAATGCTG TTTTCCGTCG TAAACCTAAC
GTATCTCCTT ACCATCCTGT TTTAGTCGAT GAGTTTGCGG ATTACATTTA TAAAGCATTT
AAAAGTTTTC CTGCTCAATC ACGTAAATAT AAAACACCTT TAATTGTTAT TGCGCAAACA
ATTAGCCAAT TAGCAATTGA ACATGGACCA AGATTTATGG ATATCCTATT GGGGACATTT
CGTAACAAAC TTGTTTATGG TGATGTTACT AATGAAGACG CAAAATTGTT TAGTAAGCTA
ATGGGTACGA AAACAATTTA TGAAGCACGT GAAGGTGATC AAGAAATTGA TATGGTAACA
GCAGAAACAA AAACACAAAG TACTCGTAGA ACATCCTACT CTTATTCGAA AACAGAGGTA
CCCATTCTTT CTGAAAATGA CATTTTAATT CAAAAAGCAT TCCAGTGTGC TGCTAAAATT
GTAAAAGATA ATGCTCCTCA AGGAGGAATA CAAGTAAATG CTAACTTTGT TCCTGCCTCT
GAGTTTAAAA CAGCTAAGAT TCAAGTTGAT GCAGAAGCAG CTGCTTATTG GTTAAAAATT
CGTGAAGAGT CATTGAATGC GGAATTTAAA TATGATGATT ATGTCACAGA TACAAATGTT
GATAGTGAAG AAACTGTAAT TGAAGAATTG AGTACACCTA AAACAGCACC TACTGCAATA
GAAGGTTGGT TACAAATGTT AAATCCTGAT ACTTACGATT TTTATAGTAA TGAAGATGAT
ACTGACAATG AAGGGGAGTT AACAAATAAT CAGTCTGTAC AAGATTACTC AAGAAATTCT
GCTCCTAAAC TAGAGAAGGA GGACGCTACA GCATCAATTA CTACTTCTAC TACAACAGAA
CCCGCAAAAA TTAATGAAGT AATCAATGTT CAAAAAGAAA ATGTACCAGC CACAAGTAAA
CATAGAGAAC CTATTGAAAT TAACAACAAA GCATCAGTTA TTACTGAAAC TCCAATAGAA
GTGCCAAGGG CGAAGCATAC ACCACAATAT AGAGAGCCTG CTTACACATC GACTGTAACT
CATAATCAGC AGGAAAAGGA AATTCAACGA ACTACAGAAA TCAATGGTAA AAATACAACA
TCTAACTTAA CAATGCAATG GTTACAGCAA CAAATGGCTG CAACTGTATC TAAAGATGAT
GAAAACACAT TATCTTCAAA TACAACAGAA GAAGGTTCTC GACTTCAAAA TCGAAAAGTG
GCAGAAGTAA CTCCAGAAAG TGCGGCAATT AAGGATAAAC TCATTCAATA TATCGAAGGT
GACATTAATC ATGATTGA
 
Protein sequence
MENAVQKNAE LNNAPGNGED PKAGKSKKNP YTVTIDITKS WKYRLAARSS YYVVTLLLLL 
AWYFAFRGIY KYASFWSIQM GAPLPTLSIF GNPFETNYYL ETFFSFWILI YTWQLSTKVK
TYQEIKRRWF IFSMMLIGIV AQYLWAFSVP LAKIFIPFLN SKAGEVSLND KGLEDTILSN
FSNIMHLVFA IPVIIIILVL LWLFKIFYEH KKELLDEFGK WEYVFKIPGW MLSFLNDDRQ
RKLATSLHRF FTEKNPSKLP EPDIFLGPNS ATREMAVIQG KSLTLNIMII GNIGTGKSAA
LGLPIANQIL DYMASMINNF KTLYARKDYH SEDVKGTQVN GLTVIEPSND FCEKVYKLVL
AHKIPESVIF YLDPTNPDTP SINFVRGPVD KVAEMLCSVL TGLSDNGAGN PFFVQSERSH
LKQHIYLLKL HDSSFEARFE HLIDMYNDAN LVFEMHLKLK KRLPNDIESI PDRDERNHWR
IMKQVDEWFD LNYVPEISGG RGGGEVVYHT EGKYYGQPKI IDKQETFVRG LRNTLNDISA
QPLLRRVLCG PSDFDFEKHL EFGGILLVNT AKGELSDLSD VFGKLCLYAV QNAVFRRKPN
VSPYHPVLVD EFADYIYKAF KSFPAQSRKY KTPLIVIAQT ISQLAIEHGP RFMDILLGTF
RNKLVYGDVT NEDAKLFSKL MGTKTIYEAR EGDQEIDMVT AETKTQSTRR TSYSYSKTEV
PILSENDILI QKAFQCAAKI VKDNAPQGGI QVNANFVPAS EFKTAKIQVD AEAAAYWLKI
REESLNAEFK YDDYVTDTNV DSEETVIEEL STPKTAPTAI EGWLQMLNPD TYDFYSNEDD
TDNEGELTNN QSVQDYSRNS APKLEKEDAT ASITTSTTTE PAKINEVINV QKENVPATSK
HREPIEINNK ASVITETPIE VPRAKHTPQY REPAYTSTVT HNQQEKEIQR TTEINGKNTT
SNLTMQWLQQ QMAATVSKDD ENTLSSNTTE EGSRLQNRKV AEVTPESAAI KDKLIQYIEG
DINHD