Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0769 |
Symbol | |
ID | 3845979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 896846 |
End bp | 899482 |
Gene Length | 2637 bp |
Protein Length | 878 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637838072 |
Product | type IV pilus biogenesis protein PilN, putative |
Protein accession | YP_438966 |
Protein GI | 83716256 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02520] type IVB pilus formation outer membrane protein, R64 PilN family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.704016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGGAA TGGGCGGCGC ATTGCGCGCG GCGAGTGGGA AATTGAATGG CCTATTGAAT TTTGCACGTC AACTAGCTTT TGTCAAACTG TTCGGGGTAT ATTTGCCGCG CAATAATCAA TTCAATAATG CGACTGCGTG CGCAGACCTA TTATGTAAAG CCGAGGTAAT CCCGGGTTCG GAAAGTCATT TCCGGCCGCG AAGGCAGCCC GATGCGTCGG GCGCGGCATC CGGGATCGCC GGGCGGCAAA GCGCGGGGGC CAGCGGCGCA CGGCAGCAGT CGAGTAACGA GGAAAATCAT GATGCACGGC GCCACGGTCG CCGCGGCCGC GTTCATCGCG ATCGCGCTTT TCGCAACCGA ATGCGCGGCG AGCGGTCCGC CCGCCGGCAC CGATCCGGCA TCGGGCGGCT CATTCGCCGA GCGCTTCGAT CACGCGCCGC TCACGGCATC GCGCGTCGAC GTGTTTCGCG CGCCATCGCC GGCGGCGTCG GCCGGGGCCG CCCGGGCCGC GGACACGATG CCCGCGGCGG CGATGCCGGC CGACGCCGAC GCGACGGCCG CGCGATCCGC GCCGCAGCCG GTTGCGGACA TGCCCGTGCG GGACGTGCCC GTGCGGGACA TGCCCACGTG GGACGTGCGC GCGTCCGACG GCACGATTCG CGGCGTTTTG TCGCGATGGG CGAATACGGC GGGCTGGCAG CTCGTCTGGG ATGCGCCCGT CGATTTCAGC ATCGATGCGC AGGCGACGCT GCGCGGCTCG TTCGAGGATG CGCTGCAGGC GCTCGTCGCG AGCCTCGGCC GCACGTCCGC GCCGATCCAG GCGATTCTCT ATCGAGGCAA TCACGTGTTG CGCGTTGTCG CGCAAGGAGC GGGCTGATGC GCGTATTGTT CGCTATTTCA TTGCTGGCCG TCGCATGGCT TGCCGGCTGC ACCGGGCTTC GCGGCGGCAT CGAGCGCGAC GCGCGACGCG AGTCGAACGA ATCCGCGGCG CTCTTCAAGC GCGCGTCCGA CGGCGACAAC AGCGTGCACG CGCTTGCGCC CGTCGTCGTC GACGATGGGC TGTGGGTGTC GGCGGGCGCC GTCAAGCTGC GCAGCGGCGA GCAGTTGCCG CCGCTGTTCG ACGAGCCGGC GTCGTTCGAT CGCGCGGTCT CTTCGTTGTC CGAATTTGCC GAGCAGATCA CGCGGTTGAC CCAGGTGCCG ACGCAGGTCG CCGCGAGCGC GCAGCAGGCG GCCGCGCGTT CGCAGCAAGG CGGCGGCGCG GACGGCGCGG CGCGCAGCGC GCCCGTGTTT CTCGACGCGG CGGGCGCTCG CTCGACGCCG CCGCTGCCGC CCGGCATGCC GGGCGGCGCG TCGGCCGGCG GCGGCAAGGC GGGCGGCAGC GGCGCGGGCC CGGACGGCGG CGCCGGCACG TCGTTCGCGC CGGTGCGCAT CGTGTACGCG GGCGGCACGC TGCGCGGCCT GCTCGACGCG GCGTGCGCGC GCTTCGGCGT CTTCTGGAAA TACGAGCAGG GGACGATCCG CTTCTTCTTC ACCGATACCC GCACGTTTCA GGTCAACGCG ATTCCGGGCG ATTCGTCGCT GAACGCGTCG GTCGTGAGCG GCGCGACGAG CGACGGCACC TCGGGTGGCT CGCAGTCGGG CGGCTCGGGC GGTGGCATCA ACGGCGCGAG CGGCGGCACG ACGGGCCTCA CCGCGAACAA CACGGCGAAC ACCGCGGTGA ACTCTCAACT GTCGGTGTTC AACGGCCTGC AGAGCGCGAT CCAGTCAATG CTGTCGCGCT ACGGCAGCTC GGTCTCGTCG CCCGCTACCG GCTCGATCTC GGTGACCGAC ACGCCCGACG TGCTCGAGCG CGTCGCCGCG TTCATGACGC AGCAGAACCG CTCGCTGTCG CGGCAGGTGC TGCTCAACGT GACCGTGCTC AGCGTGTCGC TGAAGGCGGG CGACGCGTAC GGGATCGACT GGAGCCTCGT CTACAAGACG ATGTCGGCGG GATTCAACAT CACCAATCCG TTCAATCCGG TGTCGCTGAC GAAGCCGGCC GATCTGTCCG CGACCGTGCT CAGCCCGACG AGCCGCTTCA ACGGCACGAA GCTGCTGATT CGCGCGCTGT CGCAGCAGGG GACGGTGCGG CGCAAGACGT CGGCGTCGGT GACGACGCTC AACAACCAGC CGGTGCCCGT GCAGGTCGCG ACGCAGACGG GCTACCTCGC GTCGGTGTCG ACGACGAACA CCGCGAACGT CGGCTCGTCG ACGGCGCTCA CGCCGGGCGT CGTGACGACG GGCTTCAACA TGACGCTGCT GCCGCACGTG CTCGACGACG GCACCGTGAT GCTGCAGTTC TCGACGAACA TCTCGTCGCT GCTCCAGTTG AAGGAGGTGT CGAGCAGCAC GGGAGGGCGC AGCGCGGCGC GTATCCAGAC GCCCGACGTC GACATGCGCA ACTTCCTGCA GCGCGTTGCG ATGAAATCGG GCGAGACGCT CGTGATCAGC GGCTACGAAG GCGCGAACGA TTCGCTCGAC GAGCGCGGCG TGGGCACGCC GAAGATGATC GCGCTCGGCG GCGGCTACGA GGCGCAGCGT GCGCGCGAGG TGATCGTGAT CCTGATCACG CCCGTCACGC AGCGCGGCGG CGCGTAA
|
Protein sequence | MNGMGGALRA ASGKLNGLLN FARQLAFVKL FGVYLPRNNQ FNNATACADL LCKAEVIPGS ESHFRPRRQP DASGAASGIA GRQSAGASGA RQQSSNEENH DARRHGRRGR VHRDRAFRNR MRGERSARRH RSGIGRLIRR ALRSRAAHGI ARRRVSRAIA GGVGRGRPGR GHDARGGDAG RRRRDGRAIR AAAGCGHARA GRARAGHAHV GRARVRRHDS RRFVAMGEYG GLAARLGCAR RFQHRCAGDA ARLVRGCAAG ARREPRPHVR ADPGDSLSRQ SRVARCRARS GLMRVLFAIS LLAVAWLAGC TGLRGGIERD ARRESNESAA LFKRASDGDN SVHALAPVVV DDGLWVSAGA VKLRSGEQLP PLFDEPASFD RAVSSLSEFA EQITRLTQVP TQVAASAQQA AARSQQGGGA DGAARSAPVF LDAAGARSTP PLPPGMPGGA SAGGGKAGGS GAGPDGGAGT SFAPVRIVYA GGTLRGLLDA ACARFGVFWK YEQGTIRFFF TDTRTFQVNA IPGDSSLNAS VVSGATSDGT SGGSQSGGSG GGINGASGGT TGLTANNTAN TAVNSQLSVF NGLQSAIQSM LSRYGSSVSS PATGSISVTD TPDVLERVAA FMTQQNRSLS RQVLLNVTVL SVSLKAGDAY GIDWSLVYKT MSAGFNITNP FNPVSLTKPA DLSATVLSPT SRFNGTKLLI RALSQQGTVR RKTSASVTTL NNQPVPVQVA TQTGYLASVS TTNTANVGSS TALTPGVVTT GFNMTLLPHV LDDGTVMLQF STNISSLLQL KEVSSSTGGR SAARIQTPDV DMRNFLQRVA MKSGETLVIS GYEGANDSLD ERGVGTPKMI ALGGGYEAQR AREVIVILIT PVTQRGGA
|
| |