Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_5125 |
Symbol | |
ID | 7971496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 5439633 |
End bp | 5441261 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644795719 |
Product | protein of unknown function DUF894 DitE |
Protein accession | YP_002946993 |
Protein GI | 239818083 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCCGA TTTCGCCCGA AGAAACACCC ATTCCCCCCA TCCGGCCGCA GCCGCCGAAG TTCGCGGCGC TCGAGCCGCT CAAGCTGCCC GTGTTCCGCA TGCTGTGGAG CACCTGGCTC ATCGCCAACA TCTGCATGTG GATGAACGAC GTGGCGGCGG CGTGGATGAT GACCTCGCTG ACCACCTCGC CGATCTGGGT GGCGCTGGTG CAGTCGGCCT CCACCCTGCC GGTGTTCCTG CTGGGCCTGC CCAGCGGCGC GCTGGCCGAC ATCCTGGACC GCCGGCGCTG GCTGGTGGCC ACGCAGTTCT GGCTCGCGGG CACGGCCATC GTGCTGTGCG CGGCCATTGC CATGGACCTG ATGACCGCGC CGCTCTTGCT GGCGCTCACC TTCGCCAACG GCATCGGGCT GGCGCTGCGC TGGCCGGTGT TCTCGGCCAT CGTGCCCGAG CTGGTGCCGC GGCCGCAATT GCCGGCGGCG CTGGGCCTGA ACGGCATTGC GATGAACGCC TCGCGCATCA TCGGCCCGCT CACGGCCGGC ATGCTGATCG CCAGCGCGGG CAGCGTCTGG GTGTTTGCGC TCAACGCGGT GCTGTCGGTG GCCTCGGGCT TCGTGGTGCT GCGCTGGCGG CGCGAGCACA CGCCCAATCC GCTGGGCCGC GAGAAGCTCA TCAGCGCGAT GCGCGTGGGC GTGCAGTTCG TCTGGCAATC GCAGCGCATG CGCGCGGTGC TCTCGCGCAT CACCATCTTC TTCTTCCACT CCACCGCGCT GCTCGCGCTG CTGCCGCTCT TGGCACGCAA CCTCAAGGGC GGCGGCGCGG CCACCTTCAC GCTGCTGCTC GCCGCCATGG GCGCGGGCGC GATCATCGCG GTGCTGTTCC TGCCGCGGCT GCGCCAGGCG CTCGGGCGCG ACCAGCTGGT GCTGCGCGGA ACGGTGCTGC AGTCGCTCGC CACCGCGGTG ATGGCCTTCG CACCCAACGC CTGGGTGGCG GTGCCCGCGA TGTTCTTCGG CGGCATGGCG TGGATCACGG TGGCCAACTC GCTCTCGGTC TCGGCGCAGC TCGCGCTGCC CGACTGGGTG CGCGCGCGCG GCATGTCGAC CTACCAGATG GCGATCATGG GCGCGAGCGC GATCGGCGCG GCGCTCTGGG GCCAGGTGGC CACCGTCACC GACCTGCGCT CCAGCCTCGA GGTCGCCGCC GTGAGCGGCA CCCTGCTGAT GCTGGCGGCG CTGCGCTGGG TCACCGACGT CTCGGGCGAG GAAGCCGACA TGAGCCCGGC GCGGGCCGGC TGGGCCGCGG GCCCGCCGGC GGAAACGCCG GAAGAAAACG GCCGCGTGGT CATCACCATC GAATACATGA TCGACCCCGC GCGCGCGGCC GCCTTCCACC TGGTGATGCA CCAGACCCGG CGCGCGCGCC TGGGCCAGGG CGCCATCGGC TGGGAGCTGC TGCACGACAT CGCGGAGCCG GGCCGCTACC TGGAAGAGAT CGTGGACGAA AGCTGGACCG ACCACCTGCG CCGCTTCAAC CGCGCCACGG CCGCCGACAT GGCGCTGCGC GAGCGGCGGC TGGCCTTCCA CATCGGCGAA TCGCCGCCGG TGGTGACGCG CTACGTGGTG AAGCGCTGA
|
Protein sequence | MPPISPEETP IPPIRPQPPK FAALEPLKLP VFRMLWSTWL IANICMWMND VAAAWMMTSL TTSPIWVALV QSASTLPVFL LGLPSGALAD ILDRRRWLVA TQFWLAGTAI VLCAAIAMDL MTAPLLLALT FANGIGLALR WPVFSAIVPE LVPRPQLPAA LGLNGIAMNA SRIIGPLTAG MLIASAGSVW VFALNAVLSV ASGFVVLRWR REHTPNPLGR EKLISAMRVG VQFVWQSQRM RAVLSRITIF FFHSTALLAL LPLLARNLKG GGAATFTLLL AAMGAGAIIA VLFLPRLRQA LGRDQLVLRG TVLQSLATAV MAFAPNAWVA VPAMFFGGMA WITVANSLSV SAQLALPDWV RARGMSTYQM AIMGASAIGA ALWGQVATVT DLRSSLEVAA VSGTLLMLAA LRWVTDVSGE EADMSPARAG WAAGPPAETP EENGRVVITI EYMIDPARAA AFHLVMHQTR RARLGQGAIG WELLHDIAEP GRYLEEIVDE SWTDHLRRFN RATAADMALR ERRLAFHIGE SPPVVTRYVV KR
|
| |