Gene Noc_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1559 
Symbol 
ID3705817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1732474 
End bp1733838 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content52% 
IMG OID637738042 
Productmajor facilitator transporter 
Protein accessionYP_343571 
Protein GI77165046 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.956191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACTG AGGCTACTGC AGACACGACT GAAGCTCATA TCGTTGACGA ATCGGAGGTT 
AAACGCGCCG TCACCGCAGC CGCCATGGGT AATGCGTTGG AATGGTTCGA TTTTAGCATC
TACAGCTACA CCGCAGCCAC AATAGGGCAT GTGTTCTTTC CCTCTCACAG TAATACCGCC
TCTCTGTTAG CATCCTTCGG TGTGTTCACC CTTGCCTTTG TAGTAAGACC CCTGGGAGGC
TTTTTCTTCG GCCCTTTGGG AGACAAGGTA GGCCGCAACA AAGTGCTGGC ACTAACTATC
ATCTTGATGT CGGTCGCCAC CTTTTGTATT GGGATCATTC CAAGTTACGC GTCGATCGGC
GTTTGGGCGC CCATTGGCCT AATCCTGGCG AGATTGGTGC AGGGCTTCTC TACCGGCGGT
GAATACGGCG GTGCCGCCAC ATTTATTTGT GAATTCTCGC CTGATAACCG GCGTGGCTTT
TTGGGAAGTT GGTTGGAGTT CGGCACGCTA GGCGGCTACA CGCTAGGCGC CGTTCTAGTT
ACCGGCATCT CGATGGTGCT TACCAGTGAA GAATTTTTCA CCTGGGGCTG GCGTATCCCC
TTTCTGATAG CGGGTCCCTT AGGGCTACTC GGACTCTACC TGCGCCTGAA ACTCAAAGAA
AGCCCGGCAT TTAAACAGAT GAAGGAAGAT GCGGAGCAAA AGGATTCCTC CTTTCGGGAA
ATTCTTATCG TTAATCTACG TCTACAGGCG CTCTGCATCG GCTTGGTACT GATACTCAAC
ATCGCCTACT ATACGGTGCT CAGTTACCTG CCAAGCTACC TCACCGAGGT ACTGCATATA
GATGCCTCCC GCTCACTGGT ATTTCTCGTG CTGACGATGT TAGCCATGAT GTGCGTCATC
AATATGGTGG GCAAACTATC AGACCACGTG GGGCGTAAAC CGGTGCTGGT AGGCGCTTGT
ATCGGCTTTA TCATTTTGTC ATACCCAGCA TTTTGGTTAC TGTCACAACA CAGTATCACC
ACCACCGTCA TCGGTTTGGC CATTCTAGGC ACACTTGTGG TAGCGCTTGC GGGTGTCATG
CCGGCTACTT TACCTGCTAT TTTCCCAACC CACATCCGGT ACGGCGGCTT TGCCATTTCC
TACAATATTT CCACCGCTCT GTTTGGCGGC ACTGCCCCCT TGGTTATTAC CTGGTTGATC
GCGACCACCG GTGATAACTT TGTGCCTGCC TACTATCTGA TGCTGGCAGC AGCTATCGCC
ATAGTACCCA TTCTAATCAT TCCCGAAACC GCCGGTAAAC CGATGCTAGG CTCGATGGCG
GTACGCATCC AGATGAACGA TTCAGGCCCG AAAGCACGGA ACTGA
 
Protein sequence
MDTEATADTT EAHIVDESEV KRAVTAAAMG NALEWFDFSI YSYTAATIGH VFFPSHSNTA 
SLLASFGVFT LAFVVRPLGG FFFGPLGDKV GRNKVLALTI ILMSVATFCI GIIPSYASIG
VWAPIGLILA RLVQGFSTGG EYGGAATFIC EFSPDNRRGF LGSWLEFGTL GGYTLGAVLV
TGISMVLTSE EFFTWGWRIP FLIAGPLGLL GLYLRLKLKE SPAFKQMKED AEQKDSSFRE
ILIVNLRLQA LCIGLVLILN IAYYTVLSYL PSYLTEVLHI DASRSLVFLV LTMLAMMCVI
NMVGKLSDHV GRKPVLVGAC IGFIILSYPA FWLLSQHSIT TTVIGLAILG TLVVALAGVM
PATLPAIFPT HIRYGGFAIS YNISTALFGG TAPLVITWLI ATTGDNFVPA YYLMLAAAIA
IVPILIIPET AGKPMLGSMA VRIQMNDSGP KARN