Gene Noc_3022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3022 
Symbol 
ID3705769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3419251 
End bp3420669 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content55% 
IMG OID637739496 
Productmajor facilitator transporter 
Protein accessionYP_344994 
Protein GI77166469 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.358015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAA ATAAAGCAAA ACGCCGATTC ACCTTGGGGA TGACGCTTTT AGAGCGCCGC 
AGCCTCTTCT CCCTGGCTGG CATCTATTCC CTGCGTATGT TGGGATTATT TCTAATTCTG
CCGGTCTTCT CCCTCTATGC CCATGATCTC CAGGGCGCTA CCCCTGCCTT GATTGGCCTG
GCCCTGGGCG CCTATGGGAT TACCCAAGCA CTGCTCCAGA TTCCCTTCGG TTTACTCTCT
GACCGTATTG GGCGCAAACC GATCATCACT GCTGGCCTGA TCTTATTCGC CCTTGGGAGC
ATTGTGGCCG CTATGGCCGA CACCATCGCC GGAGTCATCA TTGGCCGGGC ACTGCAAGGT
ACCGGCGCTA TTGCGGCGGC GGTTATGGCG CTGGTGGCCG ATCTAACCCG GGAAGAGCAG
CGGACCAAGG CCATGGCTTT AATTGGCCTC TCTATTGGCA TGTCTTTTGC CGTTGCCCTG
GCAGCAGGAC CGGTACTCAA CCAGTGGATC GGGGTACCGG GACTGTTCTG GCTGACCGCC
ATTCTAGCGG TCTTAGGAAT CGCCGTGCTT CACCTAGGTG TTCCCCAGGT AACAGCACCC
CGTCACCACC TGGACGTGGA ACCCGCGCCT CAGCAGTTTC TCCGCGTGCT GGGAGATTTT
CAGCTGATGC GCCTAGCGTT GGGAATCTTT TTTCTGCACC TTCTGCTGAC CGCTAGCTTC
GTGGTCCTGC CCATTAGTTT ACGGGATGAA AGTGGTCTTG ATCCTGCTTA TCATGGTTAT
GTTTACCTGC CGGTATTGGT GACTTCCATC ATCGCCATGG TGCCCTTTAT CATTTTGGCG
GAAAAAAAAC GCCGCATGAA AGAAGTGTTT ATTGGCGCAG TAGCGGTGCT GGGCTTGGCG
GAATTGGCCT GGCGCTTCTT TCATCCCTCT CTGGCAGGCA CTATCGTTGC TTTATGGCTG
TTCTTCACTG CCTTTAATCT GTTGGAAGCC ACCTTGCCCT CTCTGGTCTC TAAGCAAAGC
CCCGCCGGAA GTAAGGGTAC CGCCATGGGA GTTTACTCCA CCTGCCAATT TCTGGGGGCC
TTTGTAGGCG GCTGGGCCGG CGGAGCCGTT TACGGGTATT TTGGCTTTGA AGGGGTCTTC
ACCTTTTGTG CTGGCATCGT AGCCTTGTGG CTAATCTTTG CCGCCACCAT GGAGCCGCCC
CAATACTTGC GCAGTCAAAC CCTTTCTATC GGAAAAGTGA ATCCTGATGA GGCTCAGCTT
CTGGCGAAAC GCCTTGCCCA AGTCACCGGC GTTGCCGATG TGGTAGTAGT TGCCGAAGAA
GGGATAGCCT ATCTCAAAGT GGATGATGAA CGGCTGGATA AAGCTGCTCT TACTGAAATT
GGGCCAGAGC AGATGCAATC GACTCAACCT TCAATATAG
 
Protein sequence
MKQNKAKRRF TLGMTLLERR SLFSLAGIYS LRMLGLFLIL PVFSLYAHDL QGATPALIGL 
ALGAYGITQA LLQIPFGLLS DRIGRKPIIT AGLILFALGS IVAAMADTIA GVIIGRALQG
TGAIAAAVMA LVADLTREEQ RTKAMALIGL SIGMSFAVAL AAGPVLNQWI GVPGLFWLTA
ILAVLGIAVL HLGVPQVTAP RHHLDVEPAP QQFLRVLGDF QLMRLALGIF FLHLLLTASF
VVLPISLRDE SGLDPAYHGY VYLPVLVTSI IAMVPFIILA EKKRRMKEVF IGAVAVLGLA
ELAWRFFHPS LAGTIVALWL FFTAFNLLEA TLPSLVSKQS PAGSKGTAMG VYSTCQFLGA
FVGGWAGGAV YGYFGFEGVF TFCAGIVALW LIFAATMEPP QYLRSQTLSI GKVNPDEAQL
LAKRLAQVTG VADVVVVAEE GIAYLKVDDE RLDKAALTEI GPEQMQSTQP SI