Gene Noc_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1547 
Symbol 
ID3705805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1716918 
End bp1718261 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content51% 
IMG OID637738032 
Productmajor facilitator transporter 
Protein accessionYP_343561 
Protein GI77165036 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCATG AGCTACAATC TATTTCTTCA TTGCTTTTTG GTATTGCCAT TGTACTATTG 
GGTTCGGGCC TATTAGGCAC CTTAGTGGGG GTGCAAGCCA ATCAGGAGCA ATTCAGTTCC
ACGGTTATTG GTTTTATCCA ATCTGCCTTT TTTCTGGGCT ATGTGCTAGG AACCTTCCTT
TGTCCCCTTC TAATCAAGCG CGTAGGGCAT ATCCGCGTTT TTGCCACCAT GGCCGCGTTA
GGGTCGGCAA CAGCGATGGG CTTTGCACTT TGGGTCCATC CTCTCTGGTG GGTTCTATTG
CGGATGGTTT TGGGAATTTC GGTAGTAGGG CTTTATATGG TGGTTGAAAG CTGGCTCAAT
GAGCAGTCTT CCCACCATAG CCGGGGCCGG GTGTTCGCCA TTTACATGAG CATTACGTTG
ATGGCCTTGG GGTTTAGTCA GTTTCTTCTT TTAATAGAGG ATAATCATGG CTTTATCCGT
TTTGCCTTGA CCGCCGTGTT GTTTTCCCTA GCCCTGATTC CGGTTGCGTT GACCCAGACG
CTGGAACCAA AACCGATCTC CGCGCCACGC TCGAATCTTA AAGAACTTTA TTTAGTCTCG
CCCCTAGGGG TTGTGGGAGC TTTGGTGGCC GGCCTTGCCA GTGGCGCCTT TTGGGGGATG
GGAGCAGTGT TTGCCCAGAA TATTGGTCTC TCGGTCTCCA GTACCTCCGT GTTTATGAGC
ACAGTTATCT TTGGAGGCGC CCTGCTACTA TGGCCGGTGG GCTATTTATC GGATCGTTGG
GATCGGCGCA GAGTGCTCAT CATGGTTAGC TTTACCAGTG TGGCTAGCGT GTTGGGCGCC
GCCCTTGTTT TAGATGCTTC AACGCCGATG CTGCTGTTGC TTGCCTTTCT TTACGGGGGG
GTTTCTTTTT CCGTTTATGC CCTGGCCGTG GCTCACTTAA ACGATCACCT TAAGCCTGGG
GAAGTACTAG AAGCGACTCG GGGGATTCTG TTAGTTTATG GGGCTGGTTC CGCTCTGGGG
CCCTTGATTG CTGGTTTTTG CATGGCGGTT TGGGGTCCCT CCGGTTTACT AGACTATTTA
GCGGCTATTT TGGCGTTGCT CGGGCTGTTT GGCCTTTACC GCACCCAGCG GAGTGCTCCC
ATACCGGCTG AAGAACAGGG GGAATTCGTT CCCATGATAC GAACTTCTCA AGCTGTCCTT
GAAATGTATC CAGAGGCCGA TCTGGAGCCA GAATTGGACT TAGCGTTGAG TACTGATTTT
GAGGAAGAAG CAGAGCCTGA ATCCCCGCCG GATTCTTTTA GCATGGACTG GGACTCTCCG
GATTATGAGC AAGAGAGAAA ATAG
 
Protein sequence
MQHELQSISS LLFGIAIVLL GSGLLGTLVG VQANQEQFSS TVIGFIQSAF FLGYVLGTFL 
CPLLIKRVGH IRVFATMAAL GSATAMGFAL WVHPLWWVLL RMVLGISVVG LYMVVESWLN
EQSSHHSRGR VFAIYMSITL MALGFSQFLL LIEDNHGFIR FALTAVLFSL ALIPVALTQT
LEPKPISAPR SNLKELYLVS PLGVVGALVA GLASGAFWGM GAVFAQNIGL SVSSTSVFMS
TVIFGGALLL WPVGYLSDRW DRRRVLIMVS FTSVASVLGA ALVLDASTPM LLLLAFLYGG
VSFSVYALAV AHLNDHLKPG EVLEATRGIL LVYGAGSALG PLIAGFCMAV WGPSGLLDYL
AAILALLGLF GLYRTQRSAP IPAEEQGEFV PMIRTSQAVL EMYPEADLEP ELDLALSTDF
EEEAEPESPP DSFSMDWDSP DYEQERK