Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_3022 |
Symbol | |
ID | 3705769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 3419251 |
End bp | 3420669 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637739496 |
Product | major facilitator transporter |
Protein accession | YP_344994 |
Protein GI | 77166469 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.358015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAAA ATAAAGCAAA ACGCCGATTC ACCTTGGGGA TGACGCTTTT AGAGCGCCGC AGCCTCTTCT CCCTGGCTGG CATCTATTCC CTGCGTATGT TGGGATTATT TCTAATTCTG CCGGTCTTCT CCCTCTATGC CCATGATCTC CAGGGCGCTA CCCCTGCCTT GATTGGCCTG GCCCTGGGCG CCTATGGGAT TACCCAAGCA CTGCTCCAGA TTCCCTTCGG TTTACTCTCT GACCGTATTG GGCGCAAACC GATCATCACT GCTGGCCTGA TCTTATTCGC CCTTGGGAGC ATTGTGGCCG CTATGGCCGA CACCATCGCC GGAGTCATCA TTGGCCGGGC ACTGCAAGGT ACCGGCGCTA TTGCGGCGGC GGTTATGGCG CTGGTGGCCG ATCTAACCCG GGAAGAGCAG CGGACCAAGG CCATGGCTTT AATTGGCCTC TCTATTGGCA TGTCTTTTGC CGTTGCCCTG GCAGCAGGAC CGGTACTCAA CCAGTGGATC GGGGTACCGG GACTGTTCTG GCTGACCGCC ATTCTAGCGG TCTTAGGAAT CGCCGTGCTT CACCTAGGTG TTCCCCAGGT AACAGCACCC CGTCACCACC TGGACGTGGA ACCCGCGCCT CAGCAGTTTC TCCGCGTGCT GGGAGATTTT CAGCTGATGC GCCTAGCGTT GGGAATCTTT TTTCTGCACC TTCTGCTGAC CGCTAGCTTC GTGGTCCTGC CCATTAGTTT ACGGGATGAA AGTGGTCTTG ATCCTGCTTA TCATGGTTAT GTTTACCTGC CGGTATTGGT GACTTCCATC ATCGCCATGG TGCCCTTTAT CATTTTGGCG GAAAAAAAAC GCCGCATGAA AGAAGTGTTT ATTGGCGCAG TAGCGGTGCT GGGCTTGGCG GAATTGGCCT GGCGCTTCTT TCATCCCTCT CTGGCAGGCA CTATCGTTGC TTTATGGCTG TTCTTCACTG CCTTTAATCT GTTGGAAGCC ACCTTGCCCT CTCTGGTCTC TAAGCAAAGC CCCGCCGGAA GTAAGGGTAC CGCCATGGGA GTTTACTCCA CCTGCCAATT TCTGGGGGCC TTTGTAGGCG GCTGGGCCGG CGGAGCCGTT TACGGGTATT TTGGCTTTGA AGGGGTCTTC ACCTTTTGTG CTGGCATCGT AGCCTTGTGG CTAATCTTTG CCGCCACCAT GGAGCCGCCC CAATACTTGC GCAGTCAAAC CCTTTCTATC GGAAAAGTGA ATCCTGATGA GGCTCAGCTT CTGGCGAAAC GCCTTGCCCA AGTCACCGGC GTTGCCGATG TGGTAGTAGT TGCCGAAGAA GGGATAGCCT ATCTCAAAGT GGATGATGAA CGGCTGGATA AAGCTGCTCT TACTGAAATT GGGCCAGAGC AGATGCAATC GACTCAACCT TCAATATAG
|
Protein sequence | MKQNKAKRRF TLGMTLLERR SLFSLAGIYS LRMLGLFLIL PVFSLYAHDL QGATPALIGL ALGAYGITQA LLQIPFGLLS DRIGRKPIIT AGLILFALGS IVAAMADTIA GVIIGRALQG TGAIAAAVMA LVADLTREEQ RTKAMALIGL SIGMSFAVAL AAGPVLNQWI GVPGLFWLTA ILAVLGIAVL HLGVPQVTAP RHHLDVEPAP QQFLRVLGDF QLMRLALGIF FLHLLLTASF VVLPISLRDE SGLDPAYHGY VYLPVLVTSI IAMVPFIILA EKKRRMKEVF IGAVAVLGLA ELAWRFFHPS LAGTIVALWL FFTAFNLLEA TLPSLVSKQS PAGSKGTAMG VYSTCQFLGA FVGGWAGGAV YGYFGFEGVF TFCAGIVALW LIFAATMEPP QYLRSQTLSI GKVNPDEAQL LAKRLAQVTG VADVVVVAEE GIAYLKVDDE RLDKAALTEI GPEQMQSTQP SI
|
| |