Gene Noc_2339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2339 
Symbol 
ID3704761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2683688 
End bp2684704 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content48% 
IMG OID637738822 
ProductUDP-glucose 4-epimerase 
Protein accessionYP_344327 
Protein GI77165802 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000378791 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAGA AAGGAATACT TGTCACTGGT GGCGCCGGCT ATATCGGCAG CCATGTGGTC 
CAGCAGCTGA TGGCGACATC TCATCGAGTC ATTGTTCTAG ACAATCTCTC AACAGGTTTT
GCCAATGCGG TTCCCAAGGC TAATCTCGTA ATTGGGGATA CCAAAGACAA GGTATTGGTG
GATACACTTC TGAAAGAGTA TAGCGTAGAC ACCGTGATGC ACTTTGCTGC TTACACCATT
GTCCCGGAAT CTGTTGCCGA TCCTCTTAAA TATTACGCCA ATAATACTTG CCACACGCAC
AATCTCCTAG AGTGCTGCGC AGCAGCAGGG GTAAAGCACT TTATCTTCTC CTCCACGGCA
GCCACCTATG GTATCCCCTC CACGCCCTTG GTTACCGAAG ATACTCCTAC CATTCCTGTA
AACCCTTATG GTACCTCCAA ACTAATGAGC GAATGGATGC TACGTGACTT ATCTCAGGCA
AGCAGTCTTA ATTATGTAAC TCTGCGCTAT TTTAACGTCG CCGGCTCCGA TCCAAATGGG
CATATTGGCC AATCAACCCG TAAGGCCACG TTACTCATTA AAGTGGCCTG TGAGGCCGCA
GTAGGAAAGC GAGAGCAAGT ATATATTTTT GGTACTGATT ACCCGACCCC CGATGGCACA
GGAATCCGTG ATTATATTCA TGTAGAAGAC TTAGCAAACG CTCATATTTT GGCATTGGAT
TATCTTAAAC AGGGTGGAAA ATCAACTACC TTAAACTGTG GCTATGGCCA TGGCTATAGT
GTGCGCGAAG TACTTGACGC AGTCCAACGG GTGCATGGCC GCCCTATTAA AATAGTAAAG
CATCCTCGCC GTCCCGGTGA CCCGCCCCGC TTAGTCGCTG CTGCTCAACA AGTGCGGAAC
GTATTAGGCT GGCAACCAAA ATACGATAAT CTGGATTTTA TCGTCAAAAC TTCGCTAAAC
TGGGAATATA AATTACTGGC ACGCGACCGG CAATCTACGA TAAACGCTAA AAGCTGA
 
Protein sequence
MAKKGILVTG GAGYIGSHVV QQLMATSHRV IVLDNLSTGF ANAVPKANLV IGDTKDKVLV 
DTLLKEYSVD TVMHFAAYTI VPESVADPLK YYANNTCHTH NLLECCAAAG VKHFIFSSTA
ATYGIPSTPL VTEDTPTIPV NPYGTSKLMS EWMLRDLSQA SSLNYVTLRY FNVAGSDPNG
HIGQSTRKAT LLIKVACEAA VGKREQVYIF GTDYPTPDGT GIRDYIHVED LANAHILALD
YLKQGGKSTT LNCGYGHGYS VREVLDAVQR VHGRPIKIVK HPRRPGDPPR LVAAAQQVRN
VLGWQPKYDN LDFIVKTSLN WEYKLLARDR QSTINAKS